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A clean, green science machine 


As the world warms and technology improves, researchers and institutions should look at their 
carbon footprints and question whether they really need to travel to academic conferences. 


way, media stories appear about the carbon emissions generated 

as thousands of government officials, environmentalists and 
scientists fly in from around the world. Similar questions have been 
raised about major environmental-science conferences, such as the 
Fall Meeting of the American Geophysical Union (AGU), which last 
year drew an astounding 24,000 people. But rarely does this discussion 
move beyond the obvious. It can indeed seem a bit disingenuous that 
people who are trying to understand and protect the planet will engage 
in such a high-carbon activity as aviation, but what is to be done? 

Researchers at the Tyndall Centre for Climate Change Research 
in Norwich, UK, have published a thoughtful working paper (see 
go.nature.com/zdzitd) that lays out a practical methodology to tackle 
these questions, from the standpoint of both individual academics and 
institutions. The bottom line is that aviation emissions are rising, and 
that academics in the industrialized world are responsible for more 
than their fair share, thanks to the countless conferences, the inter- 
national nature of science and the need to do fieldwork in far-flung 
locales. Ifacademics are to fall into step, they must curb their aviation 
emissions in concert with the rest of the world. 

Academics in all disciplines — not just climate experts — should 
read the paper, not least because it goes beyond posing moral ques- 
tions and delves into solutions. Simply purchasing carbon offsets will 
not balance things out, the authors argue, because the market for reli- 
able offsets may be limited and the net effect could be to legitimize 
unnecessary travel. If scientists want to bolster their credibility on the 
subject of global warming, the authors say, then they must harness the 
power of the Internet and reduce the time they spend in the air. Indeed, 
the authors propose that a greater reliance on online conferencing and 
social media will not just reduce emissions, it will make science more 
inclusive. The conferences that do need to take place could be more 
centrally located for the target audience. And to keep track of it all, 
the paper provides an accounting system that allows institutions to log 
“hours in motion” and calculate the resulting emissions. 

The arguments the authors put forward are powerful, and there are 
good reasons to pursue their recommendations. In fact, some quarters 
have already come to similar conclusions and are taking appropriate 
action: the AGU’s leaders, for instance, have started to look for ways 
to hold parallel conferences and online dialogues that will allow more 
academics, often from developing countries, to engage in its meetings. 
There is real value in face-to-face interaction, and we must not forget 
that. But the Tyndall Centre is right to point out that senior researchers 
probably do not need to fly halfway around the globe simply to present 
a paper at a conference. 

In some ways, the working paper opens the door to questions that 
are even harder to answer. Do scientists have a responsibility to stop 
eating meat, given what we know about the greenhouse-gas intensity 
of beef production and to a lesser extent that of pork and chicken? 


| 1 very time the United Nations climate negotiations get under 


Should we expect them to park their cars and take the bus or train 
instead? The fact is that these are personal choices that academics, like 
everybody else, must grapple with. 

Nor does the buck stop with individuals. Institutions, academic or 
otherwise, have a large influence on the daily lives that their employees 
lead. It might seem extreme for universities to force vegetarian fare 
on their students (although many institutions now have a meat-free 

day), but bosses could go a long way towards 


“Lots of reducing traffic jams and carbon emissions 
seemingly small _ by encouraging their employees to telecom- 
actions canhave mute. Andin the long run, does it make sense 


an effect.” for an institution or company to purchase 
cheap real estate in the suburbs and then 
force its employees to bear the cost of commuting? 

There are plenty of solutions to be had at many levels, and all options 
may need to be exercised to stave off the worst impacts of global warm- 
ing. But lots of seemingly small actions can have an effect: last week, 
the International Energy Agency reported that global carbon emis- 
sions stalled in 2014, even as the global economy grew by 3%, appar- 
ently thanks in part to efforts by China and industrialized nations to 
boost energy efficiency and adopt renewable sources. 

Irrespective of what academics do, it seems likely that steaks, cars 
and planes will not only persist but will increase in number as the 
population booms and becomes more affluent. Ultimately, the world 
must identify better ways to raise beef and move people around. And 
scientists have a key role in making that happen, even if it means 
hopping on a flight to the next United Nations climate summit. m 


Hollow humour 


The public’s distorted attitude towards mental- 
health conditions hampers their treatment. 


hat is so funny about obsessive-compulsive disorder (OCD)? 
Wes Cefalu, a professor of English at Lafayette College in 
Easton, Pennsylvania, asked that question in a 2009 article 
that analysed why OCD is so frequently, and unfairly, represented in the 
media with “humor and levity” (P. Cefalu PMLA http://doi.org/c2mtt4; 
2009). Unlike other mental-health disorders, such as depression and 
schizophrenia, those who suffer from obsessions and compulsions, 
Cefalu wrote, “can always be counted on to make us laugh”. 
The laughs are thinner on the ground in This Room, an auto- 
biographical one-woman play about OCD, written and performed 
by Laura Jane Dean. In a review on page 289, Emily Holmes calls it 
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“an affecting, sobering account of a life shaped by, yet transcending, a 
mental-health condition and treatment”. 

The portrayal of mental-health conditions (or, to be less semanti- 
cally guarded, mental illnesses) in the media and popular culture has a 
significant influence on the way that many people view both the 
conditions and those who have them. Next to lawyers and police officers, 
physicians are among the most frequently fictionalized professionals, 
and psychiatrists feature heavily. Despite the well-quoted statistic that 
one in four people will experience a mental illness at some point in their 
lives, the medical reality is something more easily viewed at a distance, 
peeking through the silver screen at the misery of somebody else. When 
that picture is distorted, the result is fear and mistrust, not just of the 
mental-health professionals but also of their patients. 

So why do Hollywood and the rest of popular culture often get 
psychiatric medicine so wrong? Why do we still talk of the psychiatrist's 
couch when most people who see a psychiatrist never sit on one? 

Too many accounts show only the illness, or at least a crude stereo- 
type of it, not the person. And, crucially, they do not show how the 
person can often improve with the right treatment. Miracle cures are 
as rare in mental as in physical medicine, but many (perhaps most) 
people with psychiatric problems who receive the right kind of help 
can start to feel better. It is important to acknowledge and portray this. 

A study published in February in the journal Social Science & Medi- 
cine (E. E McGinty et al. Soc. Sci. Med. http://doi.org/2v4; 2015), for 
example, suggested that stories about people who were successfully 
treated for depression, schizophrenia and drug addiction reduced nega- 
tive attitudes towards mental illness in those who heard them. Tales of 
the same problems left untreated produced more willingness to dis- 
criminate against sufferers. This discrimination and prejudice is known 
to act as a barrier to people’s recovery, because they can experience 


social rejection when word of their condition gets around. 

Perhaps here is a way to tackle the stigma of mental illness. It is not 
enough for scientists, journalists and campaigners to ‘raise awareness 
by highlighting the reality of such conditions, and by publicly criticizing 
those who misuse the terms and language of psychiatric medicine. It 
is not enough to point out that phrases such as ‘schizophrenic foreign 
policy’ and ‘alittle bit OCD’ are offensive because they misrepresent 

and trivialize genuine and serious suffering. 


“The way It is not enough to emphasize the biological 
to break the basis of mental illness. To truly change public 
destructive attitudes, the message must go out more often 
cycle, to treat the that this suffering can be alleviated. 

OCD, is to resist In doing so, a powerful cultural myth must 


be challenged: that mental illness is a gift 
and comes with benefits. From the supposed 
enhanced creativity and meaningful visions of those with schizophre- 
nia to the claimed cognitive wizardry and insight of people with autism, 
mental-health conditions are too often presented as just another way of 
seeing the world. This sense of instant karma might soften the blow to 
audiences, to some patients even, but it feeds the damaging impression 
that psychiatrists are out to rob special people of their gifts and unique 
potential. It is hard enough for people with mental-health problems to 
seek help, without their fearing those who are best placed to provide it. 

What is so funny about OCD? The answer, Cefalu concluded, is the 
incongruity of the condition: the harder a sufferer tries to help them- 
selves with comforting rituals, the worse their torment becomes. The 
way to break the destructive cycle, to treat the OCD, is to resist the 
ritual, to stop playing to the crowd. That was what Laura Jane Dean 
managed to do. That was her treatment. That is her story. And stories 
do not have to be funny to havea happy ending. m 


the ritual.” 


Strike a chord 


The latest episode of the Nature Audiofile 
podcast looks at how music inspires science. 


cc athematics and music! The most glaring possible oppo- 
M sites of human thought! And yet connected, mutually 
sustained!” Thus enthused German scientist Hermann 

von Helmholtz in an 1857 essay on harmony in music. 

Today, mathematicians, ecologists and physicists search for 
harmony alongside musicians, if not always explicitly. But the link 
used to be even stronger. 

A typical academic curriculum circa 1600 — and for centuries 
before — blended music and (what we would call) science to a degree 
rarely seen in today’s undergraduate syllabi. The four subjects of the 
‘quadriviuny were arithmetic, geometry, astronomy and music. 

Today’s scientific training programmes tend to leave music out. 
They have changed in other ways too, of course, and for the better. 
Although some universities offer degrees in mathematics and music, 
or physics and music, modern scientists more often miss the chance 
to seek musical connections. 

“Many people think music is a charming accompaniment to 
thought,’ says musician and scientist Peter Pesic of St John’s College in 
Santa Fe, New Mexico, in the latest episode of Nature’s sound-science 
podcast, Audiofile (go.nature.com/xhluk3). “But developments in 
music,’ he notes, “influence other aspects of human thought” 

The podcast — one ofa series — contains plenty of musical stories 
from the history of science. Take the astronomer Johannes Kepler, who 
was preoccupied with the motions of the planets. He was desperate to 
find harmony that he felt sure existed in the way the Universe was set 
up. In 1619, he produced a giant volume called Harmonices Mundi, 
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or The Harmony of the World. In one particularly musical moment, 
he expresses planetary motions in musical notation — the orbits of 
Mercury, Venus and their neighbours spun out into crotchets and 
quavers. You can hear the resulting song on the podcast. Without 
such explicitly musical thinking, it is possible that Kepler may never 
have arrived at his third law of planetary motion — the relationship 
between a planet's distance from the Sun and the time it takes to orbit. 

Then there is the tantalizing suggestion that Galileo Galilei’s 
musical father might have influenced the way his son thought about 
science. In the late 1580s, Vincenzo Galilei carried out an experiment 
on the sounds made by strings held at different tensions. Vincenzo’s 
home-made experiments could well have instilled in his son the idea 
of looking at a physical system to produce a hypothesis, rather than 
retrofitting the one to the other. 

Musical analogies continue to help scientists to make sense of tricky 
concepts. String theorists speak of tiny vibrating strings instead of 
point-like particles. The comparison with stringed instruments is easy 
to see; the strings represent elementary particles in the same way that 
the strings on a guitar make different notes. 

Musical inspiration can often remain buried under the surface of 
scientific work; there might not seem to be an immediate harmony 
between, say, a genome-wide association study and one of Erik Satie’s 
études. But if music remains apart from the sciences, there is much we 
might lose. Anecdotally, neuroscientists talk of musical theory aiding 
the analysis of brain patterns. And a theory of strings is more intelligible 
to some than a theory of tiny, massless, vibrating subatomic squiggles. 

Looking too hard for harmony might be misleading if the real picture 
turns out to be more discordant. But, as explained in the podcast by Jim 
Bennett, emeritus director of the Museum of the History of Science in 
Oxford, UK, music has already provided a great 
template for interpreting our surroundings: 
“The insight, which plausibly came from music, 
that the world has a mathematical blueprint is 
fundamental to science.” m 
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could catch. More than 300 such conditions were discovered in 

the second half of the twentieth century alone. And how many of 
these diseases can scientists and clinicians protect against with a licenced 
vaccine? Fewer than 30. 

Those are not always the biggest killers, or the most terrifying. Vaccine 
development is driven not by the risk that a pathogen poses to people, 
but by the economic pay-off. Given the difficulty of the science involved, 
how much money will it take to develop the vaccine? And given the size 
of the market, how much money can we make by selling it? 

That helps to explain why, more than a year on from the first con- 
firmed cases of the ongoing Ebola outbreak in West Africa, no vaccine 
is available, even though work started towards one 
more than a decade ago. Phase III trials for two 
vaccines have now been launched in Liberia and 
Guinea, and we have great hope for them, assum- 
ing that there are still enough cases developing to 
test the vaccines for efficacy. But for the more than 
10,000 people who have lost their lives, and count- 
less others who have suffered and will continue to 
suffer, these trials have come too late. 

Our inability to protect people against Ebola is 
part of what makes the disease so frightening. In 
most cases, it is not what a disease is capable of 
that scares us, but that we can do so little about it. 

But why is this the case for Ebola? We have 
known about the disease since 1976, and the first 
vaccine candidate was developed more than a dec- 
ade ago. Ebola is not hypervariable like influenza 
or HIV, constantly changing and finding new ways 
to evade our immune systems, so we have had 
ample time to develop a vaccine or effective treatment during any one 
of the previous 23 outbreaks. Why were we caught by surprise this time? 

The short answer is that we were not, but that the development of 
a vaccine was considered too financially risky. With a disease such as 
Ebola, which kills ferociously but occurs sporadically and usually in 
remote areas, there is simply no commercial market. Who would buy 
it? Outbreaks usually involve only a couple of hundred cases and occur 
every few years in poor rural communities in Africa. This leaves little in 
the way of incentives for manufacturers to invest the hundreds of mil- 
lions of dollars it takes to develop a vaccine and get it clinically approved. 

It is childish to blame the drug industry for failing to develop an Ebola 
vaccine — a product with no market. Instead, governments, public 
funders and private donors should be stepping up and investing. 

We must work on a strategy that allows mean- 
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ingful quantities of proven vaccines to be quickly NATURE.COM 
produced and distributed when an outbreak _ Discuss this article 
occurs — of Ebola or other infectious diseases. online at: 
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WE NEED TO 


STOP WAITING 
UNTIL WE SEE 
EVIDENCE OF A 

DISEASE BECOMING A 


GLOBAL 
THREAT 


BEFORE WE TREAT IT 
LIKE ONE. 


Share the risks of Ebola 
vaccine development 


Ebola vaccines have little in the way of commercial markets, so the risks 
should be shared between governments and industry, says Seth Berkley. 


that demands better disease surveillance. More and better-equipped 
laboratories, as well as trained epidemiologists, in developing countries 
would improve our ability to quickly detect and investigate outbreaks of 
commonly occurring diseases, as well new threats. 

The vast amount of data produced by this kind of surveillance 
network would have an added bonus. With the right smart data-mining 
algorithms, the information could be used to radically increase our 
understanding of how pathogens travel and mutate, and then how our 
immune systems respond to these changes. 

When an outbreak occurs and vaccines are needed, it would help 
significantly to have vectors ready to deliver them. With the right invest- 
ment, these vectors, typically a harmless virus or bacterium, could be 
prepared and tested in advance. Crucially, they 
could be pressed into service to tackle a range of 
diseases. Four of the five Ebola vaccines currently 
going through clinical trials use vectors developed 
and tested for HIV. 

Such generic vectors would, in effect, 
modularize the vaccine development pro- 
cess — conducting much of the safety testing and 
ironing out manufacturing processes for different 
vectors ready for the addition of a ‘payload’ anti- 
gen. By developing such mechanisms in advance, 
and pre-testing them for safety and dose, we can 
save significant amounts of money and time by 
having stockpiles frozen and ready for use or effi- 
cacy testing as soon as an outbreak occurs. 

This is similar to the way in which technology 
developed using public funds through NASA 
has reduced the cost of placing scientific probes, 
telescopes and satellites into space. Same rocket, 
different payload. 

It demands a different attitude to disease control. We need to stop 
waiting until we see evidence of a disease becoming a global threat 
before we treat it like one. Vaccine development is expensive, but the 
United States currently spends at least US$11 billion a year to keep fleets 
of nuclear-armed submarines patrolling the oceans to protect people 
from a threat that will almost certainly never happen. That is 60 times 
more than the World Health Organization puts into global disease 
preparedness. 

Governments and donors need to invest in public-health capability, 
and they need to take on more of the risk of investing in vaccine develop- 
ment. We must view vaccines as the ultimate deterrent: make sure they 
are there, and pray that we never have to use them. m 


Seth Berkley is chief executive of Gavi, the Vaccine Alliance, based in 
Geneva, Switzerland. This article is based on a TED talk delivered in 
Vancouver, Canada, on 18 March. 

e-mail: sberkley@gavi.org 


19 MARCH 2015 | VOL 519 | NATURE | 263 


© 2015 Macmillan Publishers Limited. All rights reserved 


RESEARCH HIGHLIGHTS 


Methane’s great 
Arctic escape 


Methane is moving from 
thawing Arctic soils into lakes 
and could be released into 

the atmosphere. Methane is a 
potent greenhouse gas, so this 
mechanism might exacerbate 
future Arctic warming. 

A team led by Adina 
Paytan of the University of 
California, Santa Cruz, found 
that levels of methane were 
higher in soils around Toolik 
Lake, Alaska, than in the lake 
water itself. Geochemical 
measurements suggested that 
the methane is transported 
from the soil’s active layer, 
which freezes and thaws every 
year, into the lake and then 
into the atmosphere. 

If that pattern holds true for 
other northern lakes, soil could 
have a bigger role in sending 
methane into Arctic air than 
previously thought. 

Proc. Natl Acad. Sci. USA http:// 
doi.org/2sh (2015) 


Nanoparticles 
turn on neurons 


Gold nanoparticles can be 
attached to neurons and used 
to stimulate the cells, without 
introducing any genes. 
Current ‘optogenetic 
methods use light to excite 
specific neurons, but genes 
must first be inserted into the 
cells to make them sensitive 
to light. To develop an 
alternative method, Francisco 
Bezanilla at the University 
of Chicago in Illinois, David 
Pepperberg at the University 
of Illinois at Chicago and their 
colleagues used molecules 
including antibodies to attach 
20-nanometre-wide gold 
spheres to three different 
ion channels on the surface 
of cultured neurons. When 


Selections from the 
scientific literature 


Flowers choose the best pollinators 


A tropical flower can turn on reproduction after 

it has been visited by a high-quality pollinator. 
Matthew Betts of Oregon State University 

in Corvallis and his colleagues focused on 

the plant Heliconia tortuosa (pictured) and 

collected 148 of its pollinators, comprising 

six hummingbird species and one species of 

butterfly. The animals were cleaned of any 

pollen and introduced to aviaries containing 

flowers that had been hand-pollinated. The 

plants showed signs of successful reproduction 


the researchers flashed a 
millisecond pulse of light, the 
gold heated up, causing most 
of the neurons to fire. The 
same thing happened when 
they injected the nanoparticles 
into a specific region of a 
mouse brain slice. 

Neuron http://doi.org/2sj (2015) 


Liquid metal motor 
moves by itself 


A tiny drop of liquid metal can 

propel itself for more than an 

hour without external help. 
Millimetre-scale motors 
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could find uses as sensors, 
pumps and drug carriers, but 
they often require external 
drivers such as electric fields. 
Jing Liu and his colleagues at 
Tsinghua University in Beijing 
created a 60-microlitre liquid- 
metal motor that drove itself 
at around 5 centimetres per 
second by ‘eating’ aluminium. 
The team applied flakes of 
aluminium to droplets of an 
alloy of gallium and indium. 
A chemical reaction between 
the aluminium, the alloy and 
a surrounding electrolyte 
propelled the metal beads 
around a Petri dish or through 


zig-zag and U-shaped channels. 
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only after their nectar had been drunk by 
hummingbird species with long curved 
beaks, such as the green hermit (Phaethornis 
guy; pictured). Hummingbirds without the 
specialized bills and butterflies took in less 
nectar and failed to trigger reproduction. 
Preferred birds also have the widest ranges, 
suggesting that the plants are boosting their 
chances of receiving pollen from distant flowers 
with more genetic diversity than nearby plants. 
Proc. Natl Acad. Sci. USA http://doi.org/2sf (2015) 


The authors say that the 
work is a step towards creating 
a self-powered soft robot that 
can change shape according to 
its environment. 

Adv. Mater. http://doi.org/f26cb6 
(2015) 


Self-cleaning 
paint works in oil 


A coating that can be easily 
applied to various surfaces 
repels water and dirt, even 
when exposed to oil. 

Other water-repellent paints 
stop working in oil and are 


MATT BETTS 


J. TEYSSIER, S. V. SAENKO, D. VAN DER MAREL AND M. C. MILINKOVITCH 


JOSEPH E. MONDLOCH 


easily scratched. To overcome 
this, Ivan Parkin at University 
College London and his 
colleagues covered titanium 
dioxide nanoparticles with 
ahydrophobic polymer and 
suspended the particles in 
ethanol. 

They sprayed or painted the 
suspension onto hard surfaces 
such as glass and steel, and 
dipped soft fabric materials 
into it. The coating repelled 
water and dirt, and did so 
even after being exposed to 
oil. Bonding the coating to 
surfaces using commercial 
adhesives made the film 
resistant to scratches froma 
knife and sandpaper. 

The material could be useful 
in industrial applications 
that involve harsh and oily 
environments, the authors say. 
Science 347, 1132-1135 (2015) 


Metal framework 
zaps nerve agents 


A crystalline compound 
catalyses the destruction of a 
nerve agent much faster than 
other clean-up chemicals do. 
Omar Farha and Joseph 
Hupp at Northwestern 
University in Evanston, 
Illinois, and their colleagues 
studied a metal-organic 
framework (MOF) —a 
porous network of metal 
nodes linked by organic 
groups. They found that their 
zirconium-containing MOF 
(pictured) broke down half 
of a simulant of the chemical 
warfare agent DMNP in 
15 minutes. Breakdown of 
50% of the nerve agent GD 
took just 3 minutes. 
Experiments and 
calculations showed that 
this MOF performs 
quickly because the 
zirconium ion active 
sites, which catalyse 
the breakdown 
reactions, are 
more easily 
reached by the 
nerve agents than 
in other MOFs. 
Nature Mater. http:// 
dx.doi.org/10.1038/ 
nmat4238 (2015) 


MARINE MICROBIOLOGY 


Microbes lurk 
deep below the sea 


Microbial life may exist far 
deeper in the ocean floor than 
is often assumed. 

Steven D’Hondt at the 
University of Rhode Island in 
Narragansett and his colleagues 
sampled sediments across 
the southern Pacific Ocean. 
They found that oxygen, 
and microbes that require it, 
permeated depths of up to 
75 metres below the sea floor 
— more than double previous 
estimates. 

The team found that oxygen 
penetrates the entire sediment 
column where the sediment 
accumulates slowly in a shallow 
layer. On this basis, the authors 
estimate that microbes that 
use oxygen may exist at low, 
but measurable, amounts 
throughout sediment in around 
15-44% of the Pacific and in 
9-37% of the global sea floor. 
Nature Geosci. http://dx.doi. 
org/10.1038/ngeo2387 (2015) 


Milky Way has 
corrugated rings 


The Milky Way’s stars sprawl 
outwards in a series of 
concentric ripples, hinting that 
it might extend farther into 
space than was thought. 
Data from the Sloan 
Digital Sky Survey confirm a 
previously known ring of stars 
at about 9,000 parsecs from the 
Sun. They also show another 
ring about 14,000 parsecs 
from the Sun, says a team led 
by Yan Xu of the National 
Astronomical Observatories 
of China in Beijing. These 
rings each form a ripple, 
making our Galaxy 
corrugated rather 
than flat. 
The ripples 
may have 
formed when 
a dwarf galaxy 
passed through the 
Milky Way, creating 
rings of stars with its 
gravitational pull. 
Astrophys. J. 801, 105 (2015) 
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‘Science fandom’ can hurt science 


Research stories that go viral on social media can bring 
science to a wider audience. But there is a downside to this 
‘science fandom, argues writer Ben Thomas in an essay on 
the Medium website that triggered discussion online (see 
go.nature.com/k9vwqj). Much of what gets shared lacks the 
nuance and uncertainty of science — a gloss that Thomas 
dubs “scienceyness”. He writes that sharing the latest sciencey 
headlines without any critical thought or fact-checking, 
whether by scientists or non-scientists, is contributing to 
an “onslaught of misinformation” Some on social media 
thought the blame was misplaced. Picking on the consumer 
who may not have science training “is a little unfair’, says 
Lindsay Waldrop, a mathematical biologist at the University 
of North Carolina in Chapel Hill, who commented on 
the article on Twitter. Others suggested an upside to 
scienceyness. Paul Coxon, a materials 


> NATURE.COM scientist at the University of Cambridge, 
For more on UK, tweeted: ““Scienceyness” isn't bad. 
popular papers: It’s often a way for excluded groups to 
go.nature.com/pemgxc get involved.” 


BIOPHYSICS 


Chameleons tune 
cells to change hue 


to yellow-orange (right). 
The deeper layer consists of 
cells that reflect a broad set of 
wavelengths, particularly those 
in the near-infrared range. 
The first layer of cells allows 
the animals to quickly switch 
between camouflage and an 
ostentatious display to attract 
mates or expel a rival male, 
whereas the second layer 
provides thermal protection. 
Nature Commun. 6, 6368 (2015) 


Chameleons change colour by 
tuning nanoscopic structures 
in their skin cells to reflect 
different wavelengths of light. 
Michel Milinkovitch and 
his colleagues at the University 
of Geneva, Switzerland, 
studied skin cells of the 
panther chameleon (Furcifer 
pardalis) from Madagascar. 
They found that the lizards 
have two layers of specialized 
cells called iridophores. Each 
layer contains light-reflecting 
guanine nanocrystals. By 
altering the spacing between 
the crystals in the upper layer, 
the cells shift from reflecting 


CORRECTION 

In the Research Highlight 
‘X-rays reveal virus innards’ 
(Nature 519, 132-133; 
2015), the image was 
described as showing the 
virus. In fact, it shows the 
X-ray diffraction patterns of 


blue light to reflecting yellow the virus. 

or red wavelengths, which 

interact with the chameleon’s > NATURE.COM 

yellow pigments. This For the latest research published by 
produces a change in colour Nature visit: 


from green (pictured, left) www.nature.com/latestresearch 
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ARTICLE 


doi:10.1038/nature14327 


Cooperative insertion of CO, in diamine- 


appended metal-organic frameworks 


Thomas M. McDonald’, Jarad A. Mason!, Xueqian Kong”, Eric D. Bloch!, David Gygi', Alessandro Dani’, Valentina Crocella‘*, 
Filippo Giordanino*, Samuel O. Odoh*, Walter S. Drisdell°, Bess Vlaisavljevich’, Allison L. Dzubak®, Roberta Poloni”’®, 
Sondre K. Schnell””, Nora Planas”, Kyuho Lee*!°, Tod Pascal'®, Liwen F. Wan!°, David Prendergast’®, Jeffrey B. Neaton!0?, 
Berend Smit”®’, Jeffrey B. Kortright®, Laura Gagliardi®, Silvia Bordiga‘*, Jeffrey A. Reimer”® & Jeffrey R. Long’® 


The process of carbon capture and sequestration has been proposed as a method of mitigating the build-up of greenhouse 
gases in the atmosphere. If implemented, the cost of electricity generated by a fossil fuel-burning power plant would rise 
substantially, owing to the expense of removing CO, from the effluent stream. There is therefore an urgent need for more 
efficient gas separation technologies, such as those potentially offered by advanced solid adsorbents. Here we show that 
diamine-appended metal-organic frameworks can behave as ‘phase-change’ adsorbents, with unusual step-shaped 
CO, adsorption isotherms that shift markedly with temperature. Results from spectroscopic, diffraction and com- 
putational studies show that the origin of the sharp adsorption step is an unprecedented cooperative process in which, 
above a metal-dependent threshold pressure, CO, molecules insert into metal-amine bonds, inducing a reorganization of 
the amines into well-ordered chains of ammonium carbamate. As a consequence, large CO separation capacities can be 
achieved with small temperature swings, and regeneration energies appreciably lower than achievable with state-of-the-art 


aqueous amine solutions become feasible. The results provide a mechanistic framework for designing highly efficient 


adsorbents for removing CO, from various gas mixtures, and yield insights into the conservation of Mg 


* within the 


ribulose-1,5-bisphosphate carboxylase/oxygenase family of enzymes. 


Exceeding 13 gigatonnes (Gt) annually’, carbon dioxide generated from 
the combustion of fossil fuels for the production of heat and electricity 
is a major contributor to climate change and ocean acidification”’. Im- 
plementation of carbon capture and sequestration technologies has been 
proposed as a means of enabling the continued use of fossil fuels in the 
short term, while renewable energy sources gradually replace our exist- 
ing infrastructure*. The removal of CO, from low-pressure flue gas mix- 
tures is currently effected by aqueous amine solutions that are highly 
selective for acid gases’. As a result of the large energy penalty for de- 
sorbing CO; from such liquids, solid adsorbents with appreciably lower 
heat capacities are frequently proposed as promising alternatives®’. In 
particular, as a result of their high surface areas and tunable pore chem- 
istry, the separation capabilities of certain metal-organic frameworks 
have been shown to meet or exceed those achievable by zeolite or car- 
bon adsorbents*°. 

Recently, the attachment of alkyldiamines to coordinatively unsatu- 
rated metal sites lining the pores of selected metal-organic frameworks 
has been demonstrated as a simple methodology for increasing low- 
pressure CO, adsorption selectivity and capacity’"*. Most notably, 
functionalization of Mg,(dobpdc) (dobpdc*~ = 4,4'-dioxidobiphenyl- 
3,3’ -dicarboxylate), an expanded variant of the well-studied metal-organic 
framework Mg,(dobdc) (dobdc*~ = 2,5-dioxidobenzene-1,4-dicarbox- 
ylate)'*""*, with N,N’ -dimethylethylenediamine (mmen) generated an 
adsorbent with exceptional CO, capacity under flue gas conditions and 
unusual, unexplained step-shaped adsorption isotherms’. Here we 


elucidate the unprecedented mechanism giving rise to these step-shaped 
isotherms and demonstrate that replacing Mg”* with other divalent 
metal ions enables the position of the CO, adsorption step to be ma- 
nipulated in accord with the metal-amine bond strength. As we will 
show, the resulting mmen-M,(dobpdc) (M = Mg, Mn, Fe, Co, Zn) com- 
pounds, here designated ‘phase-change’ adsorbents, can have highly 
desirable characteristics that make them superior to other solid or liquid 
sorbents for the efficient capture of COp. 

Figure 1 illustrates the extraordinary advantages associated with using 
an adsorbent exhibiting step-shaped isotherms in a temperature swing 
adsorption process in comparison with the Langmuir-type isotherms 
observed for most microporous adsorbents. For carbon capture appli- 
cations, a gas mixture containing CO, at low pressure (Pgs) and low 
temperature (Tj,,) is contacted with the adsorbent, which selectively 
adsorbs a large amount of CO}. The adsorbent is heated to liberate pure 
CO, with a partial pressure of Pa.,, and is then reused for subsequent 
adsorption-desorption cycles. For a classical adsorbent (Fig. 1a), in- 
cluding all previous amine-based sorbents, the steepness of the isotherm 
gradually diminishes as the temperature increases, necessitating a high 
desorption temperature to achieve a large working capacity for a separa- 
tion. In contrast, for a phase-change adsorbent of the type investigated 
here (Fig. 1b), the position of the isotherm step shifts markedly to higher 
pressures as the temperature increases, such that a large working capa- 
city can be achieved with only a small increase in temperature. For an 
efficient carbon capture process, one would ideally create a phase-change 
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Figure 1 | Idealized CO, adsorption isotherms. Variation in the idealized 
adsorption isotherm behaviour with temperature for a classical microporous 
adsorbent (a), showing the usual Langmuir-type isotherm shape, compared 
with that of a phase-change adsorbent (b), showing step-shaped (sometimes 
referred to as ‘S-shaped’) isotherms. The double-headed black arrow indicates 
the working capacity (that is, the amount of gas removed) for a separation 
performed using a temperature swing adsorption process in which selective 
adsorption occurs at Pg, and Tj, and desorption is performed at Pa., and 


Thigh (a) or Tmedium (b). 


adsorbent with a large vertical step positioned below the partial pres- 
sure of CO, in the flue gas. 


Cooperative insertion of CO, into metal-amine bonds 

Spectroscopic and diffraction measurements were undertaken to de- 
termine the mechanism of CO, uptake leading to a steep adsorption 
step for adsorbents such as mmen-Mg,(dobpdc). In particular, powder 
X-ray diffraction studies, which were performed on the isostructural com- 
pound mmen-Mn,(dobpdc) owing to the greater crystallinity ofits base 
framework, provided detailed structural information on how CO, binds 
within the channels of the material. Diffraction data collected at 100K 
before and after exposure of a sample to 5 mbar of CO showed the unit 
cell volume contracting by just 1.112(8)%, but revealed large changes 
in the relative intensity of selected diffraction peaks (Fig. 2a). Complete 
structural models were developed for both data sets using the simulated 
annealing method, as implemented in TOPAS-Academic””, followed by 
Rietveld refinement against the data (Fig. 3, Extended Data Fig. 1 and 
Supplementary Tables 1-4). Before exposure to COs, the momen mole- 
cules were bound through one amine group to the Mn*™ sites with a 
Mn-N distance of 2.29(6) A, whereas the other amine lay exposed on the 
surface of the framework (Fig. 3c). Counter to our initial assumption 
that the uncoordinated amine groups would serve to bind CO, (refs 13, 20), 
CO, adsorption instead occurred by means of full insertion into the 
Mn-N bond, resulting in a carbamate with one O atom bound to Mn at 
a distance of 2.10(2) A (Fig. 3d). The second O atom of the carbamate 
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Figure 2 | Experimental characterization of the adsorption mechanism. 

a, Large intensity differences are apparent in the powder X-ray diffraction 
patterns (collected at 100 K) on exposure of mmen-Mn2(dobpdc) (blue) to 

5 mbar CO, (green). b, Infrared spectra on dosing an activated sample of 
mmen-Mg,(dobpdc) (black) with CO, and cooling from 150 °C to 30 °C 

(red to blue) under 5% CO, in Np. The three different regions show bands 
corresponding to N-H (left), C-O (centre) and C-N (right) stretching 
vibrations. Spectra in the left panel are artificially offset by 0.05 a.u. to aid in 
visualization. Those in the other two panels are not offset; there CO, adsorption 
is responsible for the increase in the spectral baseline due to molecular 
charge delocalization of the ammonium carbamate chains. c, Experimental 
(left) and computational (right) NEXAFS spectra of mmen-Mg,(dobpdc) at the 
N K-edge, before (blue) and after (green) CO, adsorption; the three major 
spectral changes are reproduced. d, Solid-state '"N NMR spectra for mmen- 
Mg,(dobpdc) before (blue) and after (green) exposure to CO) at 25°C. 


had a close interaction of 2.61(9) A with the N atom ofa neighbouring 
mmoen, resulting in chains of ammonium carbamate running along the 
crystallographic c axis of the structure (Fig. 3e). The observed ammo- 
nium carbamate N:::O distance was similar to the distance of 2.66- 
2.72 Aina single crystal of pure mmen-CO) (methyl(2-(methylammonio) 
ethyl)carbamate)”’. This well-ordered chain structure was maintained 
at 295 K, as determined from a full Rietveld refinement against data col- 
lected at this temperature. Thus, the adsorption of CO, at ambient tem- 
peratures is associated with a structural transition to form an extended 
chain structure held together by ion pairing between the metal-bound 
carbamate units and the outstretched ammonium group of a neigh- 
bouring mmen molecule. 

The foregoing structural information enabled the formulation of a 
detailed mechanism for the adsorption of CO, in phase-change adsor- 
bents of the type mmen-M,(dobpdc). As shown in Fig. 4, the uncoor- 
dinated amine of a mmen molecule acts as a strong base to remove the 
acidic proton from the metal-bound amine of a neighbouring mmen 
molecule. Deprotonation occurs only in the presence of CO2, such that 
simultaneous nucleophilic addition of CO, results in the formation ofa 
carbamate with an associated ammonium countercation. At suitable tem- 
peratures and pressures, rearrangement of the carbamate is possible such 
that the M-N bond is broken and a M-O bond is formed. Critically, the 
ion-pairing interaction causes the mmen molecule to stretch, destabi- 
lizing the M-N bond and facilitating insertion at the next metal site. This 
cooperative effect will propagate until a complete one-dimensional am- 
monium carbamate chain has formed. Indeed, it is this cooperativity 
that leads to the sudden uptake of a large amount of CO, and a steep 
vertical step in the adsorption isotherm. 

Infrared spectroscopy measurements performed on mmen- 
Mg,(dobpdc) fully support the proposed mechanism. As shown in 
Fig. 2b, changes to the infrared spectrum were apparent when a sample 
of the compound was cooled isobarically from 150°C to 30°C at 
1°C min“ ' under flowing 5% CO, in a N, atmosphere. At high tem- 
peratures, two distinct N-H vibrations arose at 3,258 and 3,334 cm — ‘ 
which were also present in the spectrum of mmen-Mg,(dobpdc) in the 
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Figure 3 | Powder X-ray diffraction structures of mmen-Mn,(dobpdc). 

a, b, Space-filling models of the solid-state structures of mmen-Mn,(dobpdc) 
(a) and CO2-mmen-Mn,(dobpdc) (b) at 100 K. ¢, d, Portions of the 

crystal structures for mmen-Mn,(dobpdc) before (c) and after (d) CO, 
adsorption, as determined from powder X-ray diffraction data. e, A portion of the 
crystal structure for the final configuration of CO, adsorbed within mmen- 
Mn,(dobpdc), depicting the formation of an ammonium carbamate chain along 
the pore surface. Green, grey, red, blue and white spheres represent Mn, C, O, N 
and H atoms, respectively; some H atoms are omitted for clarity. 


absence of CO, and could be attributed to the coordinated and unco- 
ordinated ends of mmen, respectively. On cooling, both of these N-H 
resonances disappeared, indicating changes to both amines of mmen, 
while a new, extremely broad N-H band characteristic of ammonium 
formation appeared. From the weak but clearly discernible C=O vibra- 
tion at 1,690 cm‘, carbamate formation between mmen and CO, oc- 
curred under all conditions, even at high temperatures. However, an 
additional sharp band at 1,334 cm ', corresponding to the C-N vibra- 
tional mode ofa carbamate, was observed only on cooling below 110 °C. 
The delayed onset of this easily recognizable band, which is diagnostic 
ofa phase-change adsorbent of the type investigated here, is attributable 
to changes in the resonance configuration of carbamate that occur on 
coordination of one of its O atoms. The normalized intensities of the 
C-N band and a second band at 658 cm’ versus temperature dem- 
onstrate that their formation was directly related to the sharp step in the 
gravimetric adsorption isobar measured under identical experimental 
conditions. From the infrared spectra it is clear that although a small 
amount of CO, can be adsorbed by means of ammonium carbamate 
formation between pairs of adjacent amines, it is specifically the adsorp- 
tion of CO, to form ammonium carbamate chains that endows these 
materials with their step-change adsorption properties (see Extended 
Data Fig. 2 for additional infrared spectra). 

To better understand the stepwise pathway by which the amines ini- 
tially adsorb CO,, density functional theory (DFT) calculations were 
paired with in situ near-edge X-ray absorption fine structure (NEXAFS) 
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Figure 4 | A cooperative insertion mechanism for CO, adsorption. Depiction 
of the mechanism for CO, adsorption at four neighbouring M-mmen sites 
within an infinite one-dimensional chain of such sites running along the 
crystallographic c axis of a mmen-M,(dobpdc) compound. Simultaneous 
proton transfer and nucleophilic attack of N on a CO, molecule forms 

an ammonium carbamate species that destabilizes the amine coordinated 

at the next metal site, initiating the cooperative adsorption of CO, by a 

chain reaction. 


measurements of the nitrogen K-edge of mmen-Mg,(dobpdc) collected 
under increasing CO, pressure (Fig. 2c and Extended Data Fig. 3)”, and 
all observed spectral changes were accurately reproduced by computed 
spectra. From the NEXAFS spectra, the new pre-edge peak at 402.3 eV 
arose solely from the carbamate nitrogen and was a clear signature of 
carbamate insertion into the metal—nitrogen bond. As in infrared spec- 
troscopy, this feature is attributable to resonance of the nitrogen lone 
pair into the m system of the carbamate after the breaking of the coord- 
inate bond using the same electron pair with the Mg metal centre. A 
second new, broad feature between 411 and 419 eV also arose solely 
from the carbamate nitrogen and was a signature of the new N-C bond 
formed on adsorption of CO). This feature appeared before insertion 
and was general to both terminal-bound and inserted carbamate moi- 
eties. Finally, the ~1 eV blueshift of the main edge peak at 405.4 eV was 
characteristic of ammonium formation. 

Solid-state NMR spectra indicated that CO, adsorption affected the 
manner in which diamines were coordinated to the metal sites of the 
framework (Fig. 2d). On exposure of mmen-Mg,(dobpdc) to CO, ISN 
chemical shifts consistent with ammonium and carbamate were observed 
at 31 and 72 p.p.m., respectively. Yet only a single '"N resonance was 
apparent for mmen-Mg,(dobpdc) in the absence of CO}. This indicates 
that the coordinated and uncoordinated ends of the mmen molecules 
were capable of interconverting on the timescale of the NMR experi- 
ment, although, as discussed above, they were distinguishable on the 
much faster timescale of infrared spectroscopy. Despite being labile, the 
amines were stable to evacuation under vacuum at high temperatures. 
This unexpected lability seems to allow substitution, but not elimina- 
tion, reactions to occur rapidly under conditions relevant to carbon 
capture. Furthermore, the sudden adsorption of CO in this compound 
is thus associated with a transition from a dynamic surface state to a 
well-ordered extended surface structure. Accordingly, the reaction with 
CO, can be considered to be thermodynamically non-spontaneous at 
low pressures because of the large decrease in entropy associated with 
this transition. Indeed, the molar entropy of gas-phase CO, was found 
to be the primary determinant of the step pressure for phase-change 
adsorbents. As shown in Extended Data Fig. 4, step pressures for all five 
phase-change metal organic frameworks were linearly correlated with 
the gas-phase entropy of CO, as a function of temperature. 


Understanding and manipulating the isotherm steps 


The mechanism of CO, adsorption suggests that variation of the metal- 
amine bond strength should provide a method of manipulating the 
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isotherm step position. The series of isostructural compounds mmen- 
M,(dobpdc) (M = Mg, Mn, Fe, Co, Ni, Zn) were therefore synthesized, 
and the CO, adsorption isotherms for each were measured at 25, 40, 50 
and 75 °C (Fig. 5). With the exception of the Ni compound, which 
showed normal Langmuir-type adsorption behaviour (Fig. 5e), all of 
the materials showed sharp isotherm steps that shifted to higher pres- 
sure with increasing temperature. Analysis of the isotherm steps at 25 °C 
yielded Hill coefficients” of 10.6, 5.6, 7.5, 11.5 and 6.0 for M = Mg, Mn, 
Fe, Co and Zn, respectively, reflecting the cooperative nature of the CO, 
adsorption mechanism. Simulated isotherms generated from grand- 
canonical Monte Carlo simulations using a simple lattice model cap- 
tured the experimentally observed isotherm step only when all mmen 
groups reacted with CO, and aligned down the crystallographic c axis 
(see Extended Data Fig. 5). 

For a given temperature, the step position varies in the order Mg < 
Mn < Fe < Zn < Co, in good agreement with the published series for 
octahedral metal complex stabilities**. The lack ofa step for the Nicom- 
pound, even at very high pressures (Extended Data Fig. 6) is attrib- 
utable to the exceptional stability of the Ni-mmen bond, which prevents 
carbamate insertion from taking place under the conditions surveyed. 
Geometry optimizations performed with periodic DFT calculations” 
using various functionals were in good agreement with experimentally 
calculated values, and the trend in the calculated adsorption energies 
was directly correlated with the calculated metal-amine bond length 
(Extended Data Fig. 5). Thus, we predict that similar variations in tuning 
step position will be possible for the M,(dobpdc) series by altering the 
sterics of the amine bound to the metal, as well as the spacer between 
the two amine groups. Hence, depending on the concentration of CO, 
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Figure 5 | CO, adsorption isotherms. Carbon dioxide adsorption isotherms 
at 25 °C (blue), 40 °C (blue-violet), 50 °C (red-violet) and 75 °C (red) for mmen- 
Mg,(dobpdc) (a), mmen-Mn,(dobpdc) (b), mmen-Fe,(dobpdc) (c), mmen- 
Co2(dobpdc) (d), mmen-Ni(dobpdc) (e) and mmen-Zn2(dobpdc) (f). Despite 
the use of aliphatic amine groups as the CO, reactive species, the metal-organic 
framework has an essential role in determining isotherm shape, owing to the 
importance of metal-ligand reorganization reactions in the mechanism. 
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present in a gas mixture, an adsorbent can be rationally designed to match 
the optimum process conditions depicted in Fig. 1. 

Although stepped adsorption isotherms” have been observed prev- 
iously in solid adsorbents, the origin of the step reported here is unique 
and distinct from all previously reported mechanisms. First, in contrast 
to most metal-organic frameworks showing such behaviour, the iso- 
therm steps reported here are not attributable to pore-opening, gate- 
opening or pore-closing processes” ””. For mmen-Mn,(dobpdc), only 
a ~1% decrease in the unit cell volume was observed on CO, adsorp- 
tion, and from Fig. 3a, b it is apparent that the entire pore surface was 
accessible to CO, throughout the adsorption isotherm. A gate-opening 
mechanism attributable to the rearrangement of flexible hydrogen- 
bonding functional groups, which function by preventing CO, diffu- 
sion into the pores at low partial pressures”, cannot explain the presence 
of distinct adsorption steps when the material is slowly cooled from high 
to low temperatures under isobaric adsorption conditions (see below). 
Second, in contrast to adsorbed-layer phase transitions on highly homo- 
geneous surfaces, the adsorbed phase reported here was stable at tem- 
peratures well above the critical temperature of CO, (ref. 31). Third, the 
phase transition was a metal cation-dependent, solid-to-solid transfor- 
mation, in contrast to liquid-to-liquid or liquid-to-solid phase change 
reactions typically reported for amine-CO, systems”. Last, under con- 
ditions relevant to CO, capture, desorption hysteresis was minimal, 
because the sharp steps occurred over a narrow pressure regime and 
the adsorption and desorption onset points were at about the same tem- 
perature and pressure (see Extended Data Fig. 7). 

Several features unique to the mmen-M,(dobpdc) series permitted 
phase transitions of this type to be observed. First, for solid ammonium 
carbamate chains to form, the metal-amine coordinate bond must be 
capable of rearrangement. Thus, only amines tethered to the solid sur- 
face through coordinate bonds rather than covalent bonds can un- 
dergo the mechanism reported here. Second, a homogeneous surface 
with appropriately positioned adsorption sites, which is dictated by the 
location of open metal sites within the pores of the metal-organic frame- 
work, is necessary. Thus, a very limited number of metal-organic frame- 
work materials would be able to mimic the adsorption behaviour 
reported here, and itis likely that no amine-functionalized mesoporous 
silica sorbent could be engineered precisely enough to meet these require- 
ments. Notably, in contrast to the pore expanded derivatives of M(dobdc) 
reported here, amine functionalization of the parent Mg,(dobdc) com- 
pound was not reported to result in stepped adsorption isotherms”. 


Low-energy carbon capture applications 


Effective adsorbents for carbon capture must possess large working ca- 
pacities for processes occurring at temperatures above 40 °C and at CO, 
partial pressures near 0.15 bar for coal flue gas or near 0.05 bar for a 
natural gas flue stream. On this basis, the location of the isotherm steps 
for the Mg and Mn compounds makes them better suited for this ap- 
plication than the Fe, Co or Zn compounds, which are better suited for 
separations from gas mixtures with higher CO, concentrations. To assess 
the utility of these phase-change adsorbents for capturing CO, ina pure 
temperature swing adsorption process, adsorption isobars were collected 
under dynamic gas flow. Samples of mmen-Mg,(dobpdc) and mmen- 
Mn,(dobpdc) were activated, saturated with 100% CO, and then cooled 
isobarically to room temperature under three different CO3-containing 
gas mixtures: 100%, 15% and 5%. The resulting isobars, shown in 
Fig. 6a,b, reveal how small changes in temperature induced large 
changes in the quantity of CO adsorbed. As shown in Fig. 6c, d, phase- 
change adsorbents showed very large working capacities when used in 
temperature swing adsorption processes. For mmen-Mg,(dobpdc) to 
give a working capacity in excess of 13 wt%, the material must simply 
swing between 100 and 150 °C. Similarly, the working capacity of mmen- 
Mn,(dobpdc) was in excess of 10 wt% when cycled between 70 and 
120 °C. In particular, to simulate a pure temperature swing adsorption 
process accurately, 15% CO, in N> was flowed over the samples during 
the cooling phase, whereas 100% CO, was used during heating phases. 
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Figure 6 | Isobaric CO, adsorption and cycling experiments. a, b, Variable- 
temperature adsorption isobars of 100% (red), 15% (green) and 5% (blue) 
CO) (in N2) for mmen-Mg,(dobpdc) (a) and mmen-Mn,(dobpdc) 

(b), showing that under dynamic conditions the sharp transition region 
allows phase-change adsorbents to achieve very large working capacities under 
a wide range of adsorption conditions. For each material, the phase-transition 
temperature is dependent on the pressure of CO, in the gas mixture, 

with higher phase-transition temperatures being observed at higher CO, 
partial pressures. c, d, Cycling data for a pure temperature swing process 
involving adsorption from a simulated coal flue gas (15% CO, in N>) 

at 100°C in mmen-Mg,(dobpdc) (c) and 70 °C in mmen-Mn,(dobpdc) 

(d), followed by desorption at 150 and 120°C, respectively, using a flow of 
100% CO,. Respective working capacities of 13% and 10% are attained, 

with no loss in capacity over the course of ten cycles. 


In contrast to experiments that use a purge gas to assist CO, desorp- 
tion, no inert gases were used to regenerate the samples. 

Estimations based on differential scanning calorimetry, thermogra- 
vimetric analysis and isosteric heat determinations indicate that mmen- 
Mg,(dobpdc) and mmen-Mn,(dobpdc) can achieve regeneration energies 
of about 2.2-2.3 MJ per kg of CO, captured. This value is appreciably 
lower than the regeneration energies attainable using monoethanola- 
mine (3.5 MJ kg~ ") or even state-of-the-art amines, such as piperazine 
and KS-1 (2.6 MJ kg ~ ')586 In contrast to aqueous amine absorbents that 
use heat exchangers to save sensible energy costs, the greater working 
capacities and smaller temperature swings of phase-change adsorbents 
allow more economical processes to be developed for a high-enthalpy 
adsorbent without the use of a heat exchanger. Because phase-change 
adsorbents saturate with CO, at their transition point, it is not neces- 
sary for adsorption to occur at the lowest possible temperature. Whereas 
we previously showed that mmen-Mg(dobpdc) can operate effectively 
under standard flue gas adsorption conditions (40 °C)’, Fig. 6 shows 
that phase-change adsorbents operated more efficiently at higher adsorp- 
tion temperatures than at lower temperatures. Because classical adsor- 
bents must operate at the lowest possible adsorption temperature to 
maximize working capacity, only phase-change adsorbents can enable 
high-temperature adsorption processes to be considered. 

Adsorbing CO, at elevated temperatures affords several additional 
process benefits besides directly decreasing sorbent regeneration energy. 
In particular, overcoming the competitive adsorption of water vapour, 
which is present in flue gas at high concentrations, presents a serious 
challenge for solid adsorbents. Amine-based solid adsorbents fare bet- 
ter than those using a purely physical adsorption mechanism, because 
they are known to retain their affinity for CO, under humid condi- 
tions’’, as also shown here for mmen-Mg,(dobpdc) (see Extended Data 
Fig. 7 for additional dynamic gas adsorption experiments). However, 
even for systems where the amine reactivity with CO, is unaffected by 
the presence of water, the physical adsorption of water on non-amine 
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binding sites increases the overall regeneration energy of the material’”*. 
As shown in Extended Data Fig. 6b, mmen-Mg,(dobpdc) adsorbed 
nearly 90% less water at 100 °C than at 40 °C. Thus, the energy penalty 
associated with desorbing co-adsorbed water can be substantially de- 
creased by performing CO, adsorption at a high temperature, obviating 
the need for strict flue gas dehumidification. No changes to the CO, 
adsorption isotherm were apparent after exposure to water at 40 or 
100 °C, indicating the stability of the mmen-Mg,(dobpdc) in the pres- 
ence of water vapour even at high temperatures. 

The high effective operating temperatures of mmen-Mg,(dobpdc) 
and mmen-Mn;,(dobpdc) offer opportunities for cost savings beyond 
just decreases in the regeneration energy. Because of the exothermic 
nature ofall adsorption processes, the incorporation of labour and ma- 
terial intensive coolant pipes into an adsorbent bed (a component of 
the considerable infrastructure cost for carbon capture) is necessary to 
maintain isothermal adsorption conditions. The rate of heat transfer 
from a sorbent bed to the coolant pipes, which contain surface tem- 
perature water at ~25 °C, is primarily dependent on the heat transfer 
coefficient of the sorbent, the total contact area between the sorbent 
and the coolant pipes, and the temperature differential between the sor- 
bent and the coolant*’. The physical size of adsorption units is dictated, 
to a great extent, by the need to provide sufficient contact area between 
the coolant and sorbent for effective heat removal. For processes that 
are limited by heat transfer rather than mass transfer, which is likely 
for many CO, capture processes using solid adsorbents, the use of high 
temperatures will maximize the temperature differential between the 
coolant and the sorbent, substantially reducing the overall bed size by 
reducing the size of the necessary contact area. By increasing the 
coolant-sorbent temperature differential from about 15 °C to nearly 
75 °C, adsorption bed size could potentially be reduced fivefold. In turn, 
smaller adsorbent beds would reduce the pressure drop across the ad- 
sorbent, reduce the size and cost of the required capital equipment, and 
allow as little as one-fifth as much adsorbent to be used. By decreasing 
these other system costs, new classes of adsorbents have the ability to 
reduce the cost of carbon capture substantially beyond simply decreas- 
ing the sorbent regeneration energy. 


A functional model for Rubisco 


The reactivity trends of the M,(dobpdc) series may help to clarify the 
evolutionary conservation of Mg”* within the active site of most pho- 
tosynthetic enzymes. Biological fixation of atmospheric CO} is effected 
primarily by the ribulose-1,5-bisphosphate carboxylase/oxygenase 
(Rubisco) enzyme. Striking structural similarities exist between mmen- 
Mg,(dobpdc) and the enzymatic pocket of Rubisco, which in its active 
form also contains an octahedral Mg * ion ligated by five oxygen donor 
ligands and a reactive aliphatic amine ligand that adsorbs gas-phase 
CO, to form an O-bound carbamate ligand (Extended Data Fig. 1f)*°*". 
Although other divalent metal ions can be incorporated into either struc- 
ture, in each case the presence of Mg” greatly enhances the reactivity 
for CO, fixation at very low CO, concentrations”. Although further 
study is necessary, the trends that we observed suggest that the inclu- 
sion of Mg’* within the active site of Rubisco may be necessary to endow 
the lysine residue that forms the enzymatically competent carbamate- 
ligated metal with sufficient reactivity at low partial pressures of CO. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


General synthesis and characterization methods. All reagents and solvents were 
obtained from commercial sources at reagent-grade purity or higher. The compound 
H,(dobpdc) was synthesized as reported previously’. No statistical methods were 
used to predetermine sample size. 

Laboratory powder X-ray diffraction patterns were collected on a Bruker AXS 
D8 Advance diffractometer equipped with Cu Ke radiation (2 = 1.5418 A), a Gobel 
mirror, a Lynxeye linear position-sensitive detector, and mounting the following 
optics: fixed divergence slit (0.6 mm), receiving slit (3 mm) and secondary-beam 
Soller slits (2.5°). The generator was set at 40 kV and 40 mA. Owing to the oxygen 
sensitivity of Fe,(dobpdc) and mmen-Fe,(dobpdc), X-ray diffraction patterns were 
collected in sealed glass capillaries placed on the powder stage. Thermogravimetric 
analysis was carried out at a ramp rate of 2°C min’ ina nitrogen flow with a TA 
Instruments Q5000. Elemental analyses for C, H and N were performed at the 
Microanalytical Laboratory of the University of California, Berkeley. 

Synthesis of Mg,(dobpdc). To a 20 ml glass scintillation vial, H,dobpdc (27.4 mg, 
0.10 mmol), Mg(NO3)2°6H2O (64.0 mg, 0.25 mmol) and 10 ml of mixed solvent 
(55:45 methanol/dimethylformamide (DMF)) were added. The vial was sealed with 
a polytetrafluoroethylene (PTFE)-lined cap and placed ina well plate 2 cm deep ona 
393 K hot plate. After 12 h a white powder formed on the bottom and walls of the 
vial. The reaction mixture was then decanted and the remaining powder was soaked 
three times in DMF and then three times in methanol. The solid was then collected 
by filtration and fully desolvated by heating under dynamic vacuum (<10 pbar) at 
523 K for 24h to afford 23.3 mg (0.073 mmol), 73% of Mg,(dobpdc). Combustion 
elemental analysis calculated (Anal. Calcd) for C;4HgO¢Mgo: C, 52.74; H, 1.90. 
Found: C, 52.47; H, 1.64. 

Synthesis of Mn,(dobpdc). To a 20 ml glass scintillation vial, H,dobpdc (27.4 mg, 
0.10 mmol), MnCl,*4H30 (49.5 mg, 0.25 mmol), and 10 ml of mixed solvent (1:1 
ethanol/DME) were added. The vial was sealed with a PTFE-lined cap and placed 
ina well plate 2 cm deep ona 393 K hot plate. After 12 ha yellow powder formed on 
the bottom and walls of the vial. The reaction mixture was then decanted and the 
remaining powder was soaked three times in DMF and then three times in meth- 
anol. The solid was then collected by filtration and fully desolvated by heating under 
dynamic vacuum (<10 ptbar) at 523 K for 24h to afford 33.8 mg (0.0889 mmol), 
89% of Mn2(dobpdc). Anal. Calcd for C)4HgO¢Mnz: C, 44.24; H, 1.59. Found: C, 
44,32; H, 1.23. 

Synthesis of Fe.(dobpdc). Anhydrous FeCl (2.85 g, 22.4 mmol), Hidobpdc (1.85 g, 
6.75 mmol), anhydrous DMF (400 ml) and anhydrous methanol (50 ml) were 
added to a 500 ml Shlenck flask under an argon atmosphere. The reaction mixture 
was heated to 393 K and stirred for 24h to afford a dark yellow-green precipitate. 
The solvent was then removed by cannula transfer and replaced with fresh anhyd- 
rous DMF. The reaction mixture was soaked three times in DMF and then three 
times in methanol. The solid was then fully desolvated by heating under dynamic 
vacuum (<10 bar) at 523K for 24h to afford 2.395g (6.28mmol), 93% of 
Fe,(dobpdc). Anal. Calcd for C,4H¢O,¢Fe2: C, 44.03; H, 1.58. Found: C, 43.72; H, 
1.48. 

Synthesis of Co,(dobpdc). To a 20 ml glass scintillation vial, H;dobpdc (41.1 mg, 
0.15 mmol), Co(NO3)2*6H20 (109 mg, 0.375 mmol) and 15 ml of mixed solvent 
(1:1:1 water/DMF/ethanol) were added. The vial was sealed with a PTFE-lined cap 
and placed in a well plate 2 cm deep on a 393 K hot plate. After 36 h a pink powder 
formed on the bottom of the vial. The reaction mixture was then decanted and the 
remaining powder was soaked three times in DMF and then three times in meth- 
anol. The solid was then collected by filtration and fully desolvated by heating under 
dynamic vacuum (<10 ptbar) at 523 K for 24h to afford 54.1 mg (0.139 mmol), 93% 
of Co2(dobpdc). Anal. Calcd for Cj 4H¢O¢Coz: C, 43.33; H, 1.56. Found: C, 42.92; 
H, 1.38. 

Synthesis of Zn2(dobpdc). To a 20 ml glass scintillation vial, H,dobpdc (27.4 mg, 
0.10 mmol), ZnBr2*2H,O (83.5 mg, 0.32 mmol) and 10 ml of mixed solvent (1:1 
ethanol/DMF) were added. The vial was sealed with a PTFE-lined cap and placed 
in a well plate 2 cm deep on a 393 K hot plate. After 12h a pale yellow powder 
formed on the bottom and walls of the vial. The reaction mixture was then dec- 
anted and the remaining powder was soaked three times in DMF and then three 
times in methanol. The solid was then collected by filtration and fully desolvated by 
heating under dynamic vacuum (<10 pbar) at 523K for 24h to afford 21.4mg 
(0.0534 mmol), 53% of Zn(dobpdc). Anal. Calcd for Cy4HgO¢Zny: C, 41.94; H, 
1.51. Found: C, 41.26; H, 1.57. 

Synthesis of Ni,(dobpdc). To a 20 ml glass scintillation vial, Hydobpdc (41.1 mg, 
0.150 mmol), Ni(NO3)2°6H2O (109 mg, 0.375 mmol) and 15 ml of mixed solvent 
(1:1:1 water/DMF/ethanol) were added. The vial was sealed with a PTFE-lined cap 
and placed in a well plate 2 cm deep ona 393 K hot plate. After 36 h, a green powder 
formed on the bottom of the vial. The reaction mixture was then decanted and the 
remaining powder was soaked three times in in DMF and then three times in meth- 
anol. The solid was then collected by filtration and fully desolvated by heating under 
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dynamic vacuum (<10 pibar) at 523 K for 24h to afford 39.3 mg (0.101 mmol), 
68% of Ni,(dobpdc). Anal. Calcd for Cj4HeO¢Niv: C, 43.39; H, 1.56. Found: C, 
43.09; H, 1.24. 

General synthesis of mmen-M,(dobpdc). In a plastic glovebag with positive N> 
pressure, 10 ml of 10% mmen solution in hexanes was added to ~ 100 mg of acti- 
vated M,(dobpdc) in a glass Micromeritics adsorption tube. The tube was sealed 
with a rubber septum and left to sit undisturbed for 4 h in the glovebag. In the glove- 
bag, the sample was collected by vacuum filtration and rinsed with five 10 ml por- 
tions of dry hexanes. The hexanes-solvated sample was desolvated under dynamic 
vacuum (<10 bar) at 348 K (for Zn) or 373 K (for Mg, Mn, Fe, Co and Ni) for 4h. 
Elemental analysis of mmen-M,(dobpdc) series. Mg: Anal. Calcd for 
CopH3oNsOgMgp: C, 53.37; H, 6.11; N, 11.32. Found: C, 52.39; H, 5.52; N, 10.36. 
Mn: Cy3H3gNyOgMnz: C, 47.47; H, 5.43; N, 10.07. Found: C, 47.26; H, 5.20; N, 
10.24. Fe: Anal. Calcd for C23H39N4O¢Fea: C, 47.34; H, 5.42; N, 10.04. Found: C, 
47.3H,4.95;N, 9.71. Co: Anal. Calcd for CosH3gN4OCo9: C, 46.82; H, 5.36;N, 9.93. 
Found: C, 46.27; H, 4.94; N, 9.61. Zn: Anal. Calcd for C,,H2,N4O0¢Znz: C, 45.78; H, 
5.24; N, 9.71. Found: C, 45.46; H, 4.75; N, 9.78. Ni: Anal. Calcd for Cz2H3pN4O¢Niz: 
C, 46.86; H, 5.36; N, 9.94. Found: C, 46.68; H, 5.33; N, 10.28. 

Low-pressure gas adsorption measurements. For all low-pressure (0-1.1 bar) gas 
adsorption measurements, 60-130 mg of adsorbent was transferred to a pre-weighed 
glass sample tube under an atmosphere of nitrogen gas and capped with a Transeal. 
Samples were then transferred manually to a Micromeritics ASAP 2020 gas adsorp- 
tion analyser and heated to the activation temperatures previously specified, under 
vacuum. The sample was considered activated when the outgas rate was less than 
2 pbar min” *. The evacuated tube containing the activated sample was then trans- 
ferred to a balance and weighed to determine the mass of the desolvated sample. 
The tube was then placed manually on the analysis port of the aforementioned in- 
strument, where the outgas rate was once again confirmed to be less than 2 pibar min”. 
Isothermal conditions were maintained at 77 K with liquid No, at 25, 40, 50 and 
75 °Cwith a Julabo F32 water circulator, and at 100 °C witha heated sand bath con- 
trolled by a programmable temperature controller. 

High-pressure gas adsorption measurements. The high-pressure CO, adsorption 
isotherm for mmen-Ni,(dobpdc) was measured on a HPVA-II-100 from Particu- 
late Systems, a Micromeritics company. Here, 0.27 g of activated mmen-Ni,(dobpdc) 
was loaded into a tared 2 ml stainless steel sample holder inside a glovebox under a N, 
atmosphere. Before the sample holder was connected to the variable compression 
ratio fittings of the high-pressure assembly inside the glovebox, the sample holder 
was weighed to determine the sample mass. 

Before mmen-Ni,(dobpdc) was measured, 25 °C CO, background measurements 

were performed on a sample holder containing nonporous glass beads that occupied 
a similar volume as a typical sample. A small negative background was observed at 
higher pressures, which may have been due to errors in volume calibrations, tem- 
perature calibrations and/or the equation of state used to perform the nonideality 
corrections. Nevertheless, the background CO, adsorption was consistent across 
several measurements and was well described by fitting to a polynomial equation. 
This polynomial was then used to perform a background subtraction on the raw 
high-pressure CO, data for mmen-Ni,(dobpdc). 
In situ infrared spectroscopy. A powdered sample of mmen-Mg>(dobpdc) (~15 mg) 
was pelletized, shaped in a self-supported wafer and placed inside a commercial 
Fourier-transform infrared reactor cell (2000-A multimode; AABSPEC), which 
allowed infrared spectra to be recorded under flow conditions at a wide range of 
temperatures. Before CO. was flowed across the sample, the sample was first acti- 
vated at 150 °C for 15 min while flowing 30 ml min * of pure nitrogen (heating ramp 
rate of 1°C min‘). Next, the gas flow was switched to 5% CO, in N; at a flow rate 
of 30 ml min™!, and the system was cooled from 150°C to 30°C at a rate of 
1°C min !. Before cooling, the sample was conditioned for 15 min at 150°C with 
the 5% CO, in N, gas mixture. After cooling to 30 °C, the sample was heated again 
to 150 °C atarate of 5 °C min” ' under Nj, to check the reversibility of the process. 
Spectra were collected every 5 °C with a resolution of 2cm™' (number of scans 
equal to 32) ona Perkin-Elmer System 2000 infrared spectrophotometer equipped 
with a HgCdTe detector. For Extended Data Fig. 2c, curve-fitting analysis was per- 
formed with the Levenberg-Maquardt method by using the OPUS software (Bruker 
Optik); 100% Gauss functions were used. 

For Extended Data Fig. 2b, the attenuated total reflectance (ATR) accessory ofa 
Perkin-Elmer Spectrum 400 was enclosed within a plastic glovebag filled with pos- 
itive gas pressure. Spectra of activated adsorbent were collected first under a Nz 
atmosphere. After 32 scans at a resolution of 4 cm ', the glovebag was allowed to 
fill with CO, for 5 min and infrared spectra of the sample were collected again under 
an atmosphere of CO}. For Extended Data Fig. 2e, a gas mixture of 20 mbar H,O, 
150 mbar CO, and 600 mbar N> was dosed onto an activated sample of mmen- 
Mg,(dobpdc) and was left to equilibrate for 6 h before being placed on the ATR 
stage of a glovebag-encased Spectrum 400 instrument. 
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Solid-state NMR spectroscopy. Solid-state NMR experiments were performed on 
a7.05 T magnet with a Tecmag Discovery spectrometer, using a Doty 5 mm triple- 
resonance magic angle spinning probe. The frequency of '°N was 30.4 MHz. '°N 
chemical shifts were referenced to '°N-labelled glycine at 33 p.p.m. relative to liquid 
ammonia. The experiments were performed at ambient temperature. Magic angle 
spinning was used to collect high-resolution NMR spectra with a spinning rate rang- 
ing from 5 to 6 kHz. The 90° pulses for 'H, °C and !°N were 3.8, 4.7 and 7 hs, 
respectively. Ramped cross-polarization with variable contact times from 1 to 10 ms 
was used to generate '°N signals. Recycle delays were set to be fivefold longer than 
the T, of protons. Two-pulse phase-modulated 'H decoupling of 100 kHz was 
applied during '°N signal acquisition. 

Isobaric CO adsorption methods. Isobaric gravimetric adsorption experiments 
were collected using a TA Instruments Q5000 analyser using premixed gas cylin- 
ders (Praxair). A flow rate of 25 ml min” ' was employed for all gases. Before each 
experiment, the samples (~5 mg) were desolvated by heating under N, for 1 h. To 
simulate temperature swing adsorption processes accurately, samples were acti- 
vated and reactivated with only 100% CO,. Sample masses were normalized to be 
0% under a 100% CO, atmosphere ~10s before the onset of each cooling cycle. 
Thus, the base mass corresponds to the weight of the metal-organic framework and 
any CO; adsorbed under the activation conditions: 150 °C for mmen-Mg,(dobpdc) 
and 120 °C for mmen-Mn,(dobpdc). Masses were uncorrected for buoyancy effects. 

For Fig. 6a, b, ramp rates of 1 °C min | were used; samples were reactivated at 
the appropriate regeneration temperature under 100% CO, for 15 min between gases. 
Switching from 100% CO, toa lower concentration of CO, occurred at the onset of 
cooling, ensuring that the lower-concentration gas would not simulate a purge gas 
for regeneration. 

For Fig. 6c, d, ramp rates of 10 °C min! were used. Samples were heated between 

the adsorption and regeneration conditions under 100% CO,; the regeneration time 
was 5 min. Samples were cooled under low-concentration CO,; the adsorption time 
was 5 min. Total cycle time was ~20 min. 
High-resolution powder X-ray diffraction. Samples of fully activated mmen- 
Mn,(dobpdc) microcrystalline powders (~10 mg) were loaded into 1.0 mm boron- 
rich glass capillaries inside a glovebox under an N2 atmosphere. The capillaries were 
attached to a gas cell, which was connected to the analysis port of a Micromeritics 
ASAP-2020 gas adsorption instrument. The capillaries were fully evacuated at room 
temperature for 30 min, dosed with 5 mbar of He [mmen-Mn,(dobpdc)], 5 mbar 
of CO2 (100K-CO2-mmen-Mn,(dobpdc)) or 100 mbar of CO, (295K-CO.-mmen- 
Mn,(dobpdc)) and then equilibrated at room temperature for 15 min, 8h or 4h, 
respectively. After equilibration, the capillaries were flame-sealed and placed inside 
a Kapton tube that was sealed on both ends with epoxy. 

High-resolution synchrotron X-ray powder diffraction data were subsequently 
collected at beamline 11-BM at the Advanced Photon Source (APS), Argonne Na- 
tional Laboratory, with an average wavelength of ~0.4137 A. Diffraction patterns 
were collected at 100 K, 100 K and 295 K for mmen-Mn,(dobpdc), 100K-CO2-mmen- 
Mn,(dobpdc) and 295K-CO,-mmen-Mn,(dobpdc), respectively. Discrete detec- 
tors covering an angular range from —6 to 16° 20 were scanned over a 34° 20 range, 
with data points collected every 0.001° 20 anda scan speed of 0.01° s_'. Owing to the 
large number of collected data points, all diffraction patterns were rebinned to a 
step size of 0.005° 20 before structure solution and Rietveld refinement. Additionally, 
all diffraction patterns showed a high-intensity peak at ~1.25° that was ~500% more 
intense than any other diffraction peak. Because this high d-spacing peak does not 
contribute important structural information and was heavily biasing all structure 
solution attempts, all data analysis was performed with a minimum 20 of 2°. 

A standard peak search, followed by indexing through the Single Value Decom- 
position approach’, as implemented in TOPAS-Academic™, allowed the determi- 
nation of approximate unit-cell parameters. Tentatively, the space groups for both 
mmen-Mn,(dobpdc) and CO2-mmen-Mn,(dobpdc) were assigned as P3221 be- 
cause the framework was expected to be isostructural to Zn2(dobpdc), which was 
previously characterized by single crystal X-ray diffraction’*. Precise unit-cell dimen- 
sions were determined by structureless Le Bail refinements (Supplementary Table 1). 
Here, the background was modelled by a polynomial function of the Chebyshev 
type, and anisotropic peak broadening was described using parameters appropriate 
for a hexagonal crystal system*’. Successful structure solution and Rietveld refine- 
ment confirmed that P3221 was indeed the correct space group for all compounds. 
Breakthrough adsorption measurements. Into a glass U-tube with an interior 
diameter of 4mm, 203 mg of activated mmen-Mg,dobpdc) powder was added. 
The column was placed inside a furnace with a programmable temperature con- 
troller. Gas flow rates of 10 ml min“ were used. Column effluent was analysed using 
a Hy-Energy/Setaram RGAPro-2500 with continuous sampling capability. Argon 
(99.999%; Praxair) was used as a purge gas for sample activation at 100°C for 
30 min. For measurements at 25 °C, ~1.5% HO was added to the gas stream by 
bubbling a premixed gas cylinder of 15% CO2, 4% O, and balance N, (Praxair) 
through a glass bubbler containing distilled water. 


NEXAFS measurements. In situ NEXAFS measurements were performed with a 
gas cell instrument previously described in detail elsewhere”**. Transmission— 
absorption samples were prepared by suspending and sonicating ~ 10 mg of mmen- 
Mg,(dobpdc) powder in ~1 ml of hexanes, and drop casting onto SiC membranes 
(300 nm thickness). The metal-organic framework-coated membranes were then 
loaded into a glass evacuation chamber that was slowly evacuated to 50 mTorr. The 
chamber was then heated to 100°C for 4h to remove all guest species from the 
pores. After heating, the chamber was cooled to ambient temperature and refilled 
with dry nitrogen gas. Samples were transferred from the evacuation chamber to 
the gas cell ina dry nitrogen glovebox to prevent exposure to any unwanted species. 
N K-edge spectra of mmen-Mg>(dobpdc) were collected at beamline 6.3.2 (10" 
photons s~') at the Advanced Light Source at Lawrence Berkeley National Labo- 
ratory, under vacuum and increasing pressures of CO) gas. The experimental pro- 
cedure was the sameas in earlier studies. The full pressure series of N K-edge spectra 
are shown in Extended Data Fig. 3. 

AIMD simulations. The equilibrium structure of mmen-Mg,(dobpdc) is obtained 
by performing constant-pressure (NPT) ab initio molecular dynamics simulations 
at room temperature. The system is equilibrated using a Parrinello-Rahman baro- 
stat and a Langevin thermostat with a time step of 0.5 fs (refs 47, 48). A plane-wave 
basis set that is truncated at 400 eV is used to represent the electronic wavefunc- 
tions within the projector-augmented wave (PAW) approximation”, as realized 
in the Vienna Ab initio Simulation Package (VASP)*"*. vdW-DF2 functional is used 
to approximate the long-range dispersion forces’, which is potentially important 
to capture the weak intermolecular interactions between the metal-organic frame- 
work and the absorbed molecules. Here, the vdW-DF2 correlation is computed 
with the use ofa 1 X 1 X 3 k-point grid to ensure that the electron density is con- 
verged for a 22 A X 21A X7 A supercell. On equilibration, eight time-separated 
(that is, uncorrelated) snapshots are taken from the molecular dynamics trajectory 
to establish an average spectrum from NEXAFS simulations. 

XAS simulations. DFT calculations used for X-ray absorption spectroscopy (XAS) 
simulations employ the PBE GGA functional”. Plane-wave pseudopotential calcu- 
lations using ultrasoft pseudopotentials” and a kinetic energy cutoff for electronic 
wavefunctions (density) of 25 (200) Rydberg (Ry) were performed using the PWSCF 
code within the Quantum-ESPRESSO package”. The core-excited Kohn-Sham 
eigenspectrum was generated using the eXcited electron and Core Hole (XCH) 
approach. On the basis of a numerically converged self-consistent charge density, 
we generated the unoccupied states for our XAS calculations non-self-consistently, 
sufficiently sampling the first Brillouin zone with a2 X 2 X 2 uniform k-point grid, 
employing an efficient implementation of the Shirley interpolation scheme” gen- 
eralized to handle ultrasoft pseudopotentials’. Matrix elements were evaluated 
within the PAW frozen-core approximation™. Core-excited ultrasoft pseudopo- 
tentials and corresponding atomic orbitals were generated with the Vanderbilt code™*. 
Each computed transition was convoluted with a 0.2 eV Gaussian function to pro- 
duce continuous spectra. 

Periodic DFT calculations. Periodic DFT calculations in this work were performed 
with the VASP 5.3.3 package*’”’. The energetics of CO, capture by the alkylamine 
moieties chemisorbed in the channels of M,(dobpdc) (M = Mg, Mn, Fe, Co, Ni, 
Zn) were computed with the PBE and MO6L functionals”. The electron-ion in- 
teractions in these calculations were described with the PAW method” with an 
energy cutoff of 550 eV. Atomic positions and lattice parameters were optimized 
until the forces on all atoms were smaller than 0.02 eV A at the -point. On-site 
Hubbard U corrections were employed for metal d electrons for M2(dobpdc) 
(M = Mn, Fe, Co, Ni)*'. The U values are determined to reproduce oxidation 
energies in the respective metal oxides”. 

Modelled adsorption isotherms. To study the differences in adsorption behaviour 
between the pair and chain models, we used a lattice model to predict the adsorp- 
tion isotherms. The lattice models are illustrated in Extended Data Fig. 5. The energy 
at each lattice point is determined by the state of the lattice point and the state of the 
surrounding lattice points. For both the pair and the chain model the interaction 
energies, yellow and red, respectively, are taken directly from the DFT calculations. 
The end ofa chain (shown in blue) is 80% of the chain model, whereas a single ad- 
sorbed CO (shown in green) is 10% of the energy ofa chain. Similarly, interactions 
between two rows (cross-channel) are set to the pair energy (or 75% ofa chain). An 
amine without CO} was not given an energy contribution. The effect of having dif- 
ferent metals was taken into account by varying the energy of the chain in accord- 
ance with the DFT energies. To compute the isotherms we performed standard 
grand-canonical Monte Carlo simulations. To compare the lattice model chemical 
potential directly with the chemical potential of CO, we used a shift of the pressure, 
which was fitted to the steps of the isotherms at the highest and lowest temperatures. 
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Extended Data Figure 1 | Powder X-ray diffraction. a-c, Rietveldrefinement — experimental and calculated patterns. Note that the intensity differences 


of mmen-Mn,(dobpdc) at 100 K (a), CO2-mmen-Mn,(dobpdc) at 100 K indicate a structural transition when CO, is adsorbed. e, Fourier difference map 
(b) and CO2-mmen-Mn,(dobpdc) at 295 K (c). The blue and red lines for mmen-Mn,(dobpdc) at 100 K. Purple, brown and red spheres represent 
represent the experimental and calculated diffraction patterns, respectively; Mn, C and O atoms, respectively; yellow blobs represent excess electron 

the grey line represents the difference between experimental and calculated density that is not accounted for in the Mn2(dobpdc) structural model and 
patterns; the green tick marks represent the calculated Bragg peak positions. that is due to the mmen bound to each Mn’" site. f, The coordination 

d, Plot of the diffraction data for CO.-mmen-Mn,>(dobpdc) at 100 K (blue), environment around Mg** in the active form of the Rubisco enzyme is 
where the calculated pattern (red) is based on the mmen-Mn,(dobpdc) structurally similar to the coordination environment around the metal cations 
structural model. The grey line represents the difference between the of mmen-Mg,(dobpdc) after adsorption of CO. 
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Extended Data Figure 2 | Infrared spectroscopy. a, On cooling mmen- 
Mg,(dobpdc) from 150 °C (red) to 30 °C (blue) under 5% COs, changes to the 
aliphatic C-H vibrations of mmen are apparent on adsorption of CO. 
Furthermore, ammonium formation from neutral secondary amines is 
indicated by the appearance of a new, broad feature centred on 3,000 cm}. 
b, Room temperature, in situ infrared spectroscopy measurements of mmen- 
M,(dobpdc) (M = Mg, Mn, Fe, Co, Zn, Ni) under N> (blue) and CO, (green) 
atmospheres. Grey lines mark diagnostic carbamate bands at 1,690 and 
1,334cm 1. c, On cooling under flowing 5% CO, in N2 from 150 to 30°C, 
the normalized mass increase of mmen-Mg,(dobpdc) measured by 


thermogravimetric analysis (black line) can be compared with the normalized 
integrated area of the infrared active bands at 1,330 and 660 cm '. The bands at 
1,330 and 660 cm’ can be assigned to v(C-N) and [B(OCO) + B(NCO)] 
modes that are characteristic of the highly ordered ammonium carbamate 
chains. d, In contrast to carbamate that is coordinated to a metal site through a 
nitrogen atom (top), changes in electron resonance configurations give rise 
to a feature at 1,334cm ' characteristic of CO, insertion into the metal- 
nitrogen bond (bottom). e, Infrared spectroscopy clearly indicates that CO, 
adsorption via metal-amine insertion occurs even in the presence of H,O owing 
to the presence of the aforementioned band at 1,334 cm}. 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


—— Activated 
—— 326 mTorr 
—— 1.2 Torr 
— 3.0 Torr 
—— 6.9 Torr 
— 10.9 Torr 
— 21.3 Torr 
—— 36.1 Torr 


Normalized intensity 


395 400 405 410 415 420 425 
Energy (eV) 


Extended Data Figure 3 | X-ray absorption spectroscopy. Experimental N _ signature of N-C bond formation, appears before the pre-edge peak at 
K-edge NEXAFS spectra of mmen-Mg>(dobpdc) in vacuum and under 402.3 eV, which is characteristic of CO. insertion. 
increasing pressures of CO, gas. The broad feature between 411 and 419 eV, a 
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Extended Data Figure 4 | Thermodynamics of CO, adsorption. a, Isosteric 
heat of CO, adsorption plots for the mmen-M,(dopbdc) series. b, Plot of the 
entropy of gaseous CO} against pressure at 298 K. c, A linear correlation 

was found to exist for each mmen-M,(dobpdc) material between the step 
pressure at any temperature and the gas-phase entropy of CO), for 
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mmen-Mg,(dobpdc) (green, R? = 0.99946), mmen-Mn,(dobpdc) (blue, 
R? = 0.99918), mmen-Fe,(dobpdc) (orange, R? = 0.99934), mmen- 
Co2(dobpdc) (purple, R? = 0.99244) and mmen-Zn,(dobpdc) (black, 
R? = 0.99932). 
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Extended Data Figure 5 | Theoretical calculations. a, Representation of the 
mapping of the hexagonal channel to a two-dimensional lattice in which each 
site consists of an amine that can interact with six neighbouring sites. Each 
amine can have one or zero CO, molecules adsorbed. A single site with a 
CO, adsorbed is shown in green. Pairs are allowed to form in both the 
crystallographic c direction and in the a-b plane (yellow); to model the chain 
mechanism, an amine in the middle of the chain is shown in red, and the amine 
at the end of the chain is in blue. b, Calculated CO, adsorption isotherms 
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indicate that only a chain model of interactions rather than pairwise adsorption 
interactions can give rise to the experimentally stepped isotherm shape. 

c, On the basis of calculated adsorption enthalpies, the relative positions of 
adsorption isotherms can be predicted from the chain model. d, e, DFT 
calculations reflect the experimentally observed trend that CO, adsorption 
enthalpy (d) is related to the strength of the nitrogen-amine bond, as reflected 
by the calculated metal-amine bond length (e). 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a 3 
eS) 
oO f 
= 
£6 
bi 
2 5 
B4 
* 3 
S 
an 2 
o 
o 1 
x 
Ww 
0 
0 10 20 30 40 50 
P (bar) 
b, c 
SB eee e@ @ 
Ss 2 aoe em © 
o 3 
E 
E xo) 
= o 
3 2 
p * 5 
— w 
2 Z @ Before H,O Isotherms 
se} N 
<x re) @ After 40°CH,O Isotherm 
om re) 
= @ After 75°C H,O Isotherm 
@ After 100 °C H,O Isotherm 
0 - 
0 4 8 12 16 20 200 400 600 800 1000 
Pressure (mbar) Pressure (mbar) 
Extended Data Figure 6 | Volumetric gas adsorption. a, High-pressure measurements of CO, at 75 °C onto a sample of mmen-Mg,(dobpdc) 
excess CO, adsorption isotherm at 25 °C for mmen-Ni,(dobpdc) indicates that _ before exposure to water and after water isotherms at 40, 75 and 100°C. No 
Langmuir-type adsorption behaviour is maintained even at high pressures. changes in the CO, adsorption isotherms were apparent from exposure of the 
b, Isothermal adsorption measurements of water onto a sample of sample to water. 


mmen-Mg,(dobpdc) at 40, 75 and 100 °C. ¢, Four isothermal adsorption 
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Extended Data Figure 7 | Dynamic gas adsorption and regeneration energy. 
a, Isobaric, variable temperature (ramp rate of 1 °C min ’) gravimetric 
adsorption experiments for mmen-Mg,(dobpdc) under 100% CO. Cooling 
from 150 to 30 °C is shown as the blue line, and heating from 30 to 150 °Cas the 
red line. Desorption hysteresis was minimal because the phase-transition 
temperature and pressure are unchanged between adsorption and desorption. 
b, Regeneration energy calculations for mmen-Mg,(dobpdc) (green) and 


Regeneration Energy (MJ/kg) & 


MEA Baseline 


40 50 60 70 80 90 


Adsorption T (°C) 


100 110 120 


400 500 600 700 


mmen-Mn,(dobpdc) (blue) indicate that effecting adsorption at high 
temperatures can be considerably more efficient than adsorption at 40 °C. 

c, Transient breakthrough of 15% CO, (green), 4% Oy (red), 1.5% water (black) 
and balance N, (blue) through an adsorbent bed packed with mmen- 
Mg,(dobpdc) at 25 °C. The adsorbent bed was under Ar (purple) before 
adsorption; a breakthrough CO, capacity of 2.7 mmolg | was calculated. 
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Fine-scale genetic variation between human populations is interesting as a signature of historical demographic events 
and because of its potential for confounding disease studies. We use haplotype-based statistical methods to analyse 
genome-wide single nucleotide polymorphism (SNP) data from a carefully chosen geographically diverse sample of 
2,039 individuals from the United Kingdom. This reveals a rich and detailed pattern of genetic differentiation with 
remarkable concordance between genetic clusters and geography. The regional genetic differentiation and differing 
patterns of shared ancestry with 6,209 individuals from across Europe carry clear signals of historical demographic 
events. We estimate the genetic contribution to southeastern England from Anglo-Saxon migrations to be under half, 
and identify the regions not carrying genetic material from these migrations. We suggest significant pre-Roman but 
post-Mesolithic movement into southeastern England from continental Europe, and show that in non-Saxon parts of 
the United Kingdom, there exist genetically differentiated subgroups rather than a general ‘Celtic’ population. 


The genetic composition of human populations varies throughout the 
world, as a result of the interplay between population movement, admix- 
ture, natural selection and genetic drift. Characterizing such geograph- 
ical population structure provides insights into demographic history 
and is critical to genetic studies of disease’. 

Human population structure is reasonably well understood at broad 
scales, for example between and within continents*”°. Here we invest- 
igate structure over much finer scales, in Caucasians within the United 
Kingdom (UK) consisting of England, Scotland, Wales and Northern 
Ireland. We use ‘Britain’ (technically Great Britain) to refer to the single 
island consisting of modern-day England, Scotland and Wales. UK popu- 
lation structure has been studied before, typically on relatively small sam- 
ples using various single-locus systems and recently genome-wide SNP 
data'"”*, These earlier studies show some regional variation at particular 
loci, with a weak, roughly north-south cline in allele frequencies genome- 
wide, suggesting that population structure in the UK is rather limited. 


Samples and analysis 


To investigate fine-scale population structure in the UK, and to pro- 
vide well-characterized controls for disease studies, we assembled a 
sample, the People of the British Isles (PoBI) collection, as previously 
described’. Our analyses used 2,039 PoBI samples from rural areas 
within the UK, genotyped as part of the Wellcome Trust Case Consor- 
tium 2 (WTCCC2), who had all four grandparents born within 80 km 
of each other. We thus effectively sample DNA from the grandparents. 
The grandparents’ average year of birth was 1885 (s.d. 18 years). As 
the DNA from each PoBI participant is a random sample of their 
grandparents’ DNA, our approach allows investigation of fine-scale 


population structure in rural areas of the UK before the major popu- 
lation movements of the twentieth century. 

To provide context for the UK samples, we analysed 6,209 samples 
from 10 countries in continental Europe genotyped in the WIT'CCC2 
study of multiple sclerosis'*. To ensure compatibility between the PoBI 
and continental European samples we restricted attention to autoso- 
mal SNPs genotyped in both samples (approximately 500,000 SNPs, 
see Methods). 


Fine-scale UK population differentiation 


Consistent with earlier studies of the UK, population structure within 
the PoBI collection is very limited. The average of the pairwise Fey esti- 
mates between each of the 30 sample collection districts is 0.0007, with 
a maximum of 0.003 (Supplementary Table 1). 

Against this background of very limited structure within the UK, 
we applied a recently developed method for detecting fine-scale popu- 
lation structure, fineSTRUCTURE”, to the PoBI samples, to look for 
more subtle effects. See Methods (also Extended Data Figs 1 and 2) for 
an informal description, details, interpretation under both discrete and 
isolation-by-distance models, assessment of convergence, and enhance- 
ments to the algorithm as applied in this study. In contrast to commonly 
used approaches such as principal components or ADMIXTURE"*, 
fineSTRUCTURE explicitly models the correlation between nearby SNPs 
and uses extended multi-marker haplotypes throughout the genome. 
This substantially increases its power to detect subtle levels of genetic 
differentiation. 

The fineSTRUCTURE algorithm can divide samples into genetic 
clusters hierarchically, from coarser to finer levels of structuring. We 
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applied fineSTRUCTURE to the PoBI samples’ genetic data without 
reference to the known geographical locations. The genetic clustering 
can be assessed with respect to geography by plotting individuals on a 
map of the UK (at the centroid of their grandparents’ places of birth) 
and examining the inferred genetic clusters, for different levels of the 
hierarchical clustering. 

Figure 1 shows this map for 17 clusters, together with the tree show- 
ing how these clusters are related at coarser levels of the hierarchy. (There 
is nothing special about this level of clustering, but it is convenient for 
describing some of the main features of our analysis; Supplementary 
Fig. 1 depicts maps showing other levels of the hierarchical clustering.) 
The correspondence between the genetic clusters and geography is strik- 
ing: most of the genetic clusters are highly localized, with many occu- 
pying non-overlapping regions. Because the genetic clustering made 
no reference to the geographical location of the samples, the resulting 
correspondence between genetic clusters and geography reassures us 
that our approach is detecting real population differentiation at fine 
scales. Our approach can separate groups in close proximity, such as in 
Cornwall and Devon in southwest England, where the genetic clusters 
closely match the modern county boundaries, or in Orkney, off the 
north coast of Scotland. 
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It is instructive to consider the tree that describes the hierarchical 
splitting of the 2,039 genotyped individuals into successively finer clus- 
ters (Fig. 1). The coarsest level of genetic differentiation (that is, the 
assignment into two clusters) separates the samples in Orkney from all 
others. Next the Welsh samples separate from the other non-Orkney 
samples. Subsequent splits reveal more subtle differentiation (reflected 
in the shorter distances between branches), including separation of north 
and south Wales, then separation of the north of England, Scotland and 
Northern Ireland from the rest of England, and separation of samples 
in Cornwall from the large English cluster. There is a single large cluster 
(red squares) that covers most of central and southern England and 
extends up the east coast. Notably, even at the finest level of differen- 
tiation returned by fineSTRUCTURE (53 clusters), this cluster remains 
largely intact and contains almost half the individuals (1,006) in our 
study. 

Although larger than between the sampling locations, estimated Fey 
values between the clusters represented in Fig. 1 are small (average 0.002, 
maximum 0.007, Supplementary Table 2), confirming that differenti- 
ation is subtle. On the other hand, all comparisons between pairs of 
clusters of their patterns of ancestry as estimated by fineSsTRUCTURE 
show highly significant differences (Supplementary Table 3). 
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Figure 1 | Clustering of the 2,039 
UK individuals into 17 clusters 
based only on genetic data. For each 
individual, the coloured symbol 
representing the genetic cluster to 
which the individual is assigned is 
plotted at the centroid of their 
grandparents’ birthplaces. Cluster 
names are in side-bars and ellipses 
give an informal sense of the range of 
each cluster (see Methods). No 
relationship between clusters is 
implied by the colours/symbols. 

The tree (top right) depicts the order 
of the hierarchical merging of 
clusters (see Methods for the 
interpretation of branch lengths). 
Contains OS data © Crown 
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We compared our approach to two widely used analysis tools, namely 
principal components*'’*”” and ADMIXTURE"* (Extended Data Fig. 3). 
Both approaches broadly separate samples from Wales and from Orkney, 
but are not able to distinguish many of the other clusters found by 
fineSTRUCTURE. We also performed analyses to show that the clus- 
tering is not an artefact of our sampling scheme preferentially selecting 
related individuals (see Methods, Extended Data Fig. 4 and Supplemen- 
tary Note). 


UK clusters in relation to Europe 

Genetic differences between UK clusters might in part reflect their rela- 
tive isolation from each other, and in part differing patterns of migra- 
tion and admixture from populations outside the UK. To gain further 
insight into this second aspect, we first applied similar fineSTRUCTURE 
analyses to 6,209 samples from continental Europe (henceforth referred 
to as ‘Europe’, see Extended Data Fig. 5a for the distribution of the 
samples by region), and then characterized the genetic composition of 
the UK clusters with respect to the genetic groups we found in Europe. 
A fuller analysis of the clustering within Europe and its interpretation 
will be described elsewhere. 

To avoid confusion below, we will refer to each of the 17 sets of indi- 
viduals defined by our fineSTRUCTURE analyses in the UK as a ‘cluster’, 
and to each of the sets of individuals defined in our analyses of Europe 
as a ‘group’. We focus in these analyses on the division of the European 
samples into 51 such groups (Extended Data Fig. 5b). We italicise names 
of UK clusters, to distinguish them from the geographic region (for 
example, the pink cross cluster Cornwall, and the county Cornwall). 
European groups are each given a unique identifying number (these 
are consecutive at the finest level of clustering, but not at the level we 
consider). In the text, groups are identified by this number and, for 
clarity, a three-letter label identifying the country (or countries) where 
the group is mainly represented (for example, GER6 for the group labelled 
‘6’, which is mostly found in Germany). 
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For each UK cluster we estimated an “ancestry profile’ which char- 
acterizes the ancestry of the cluster as a mixture of the ancestry of 
the 51 European groups. (see Methods for details, also Supplementary 
Table 4). As for the fineSTRUCTURE clustering, these analyses use no 
geographical information. The estimated ancestry profiles are illustrated 
in Fig. 2 which also depicts the sampling locations in Europe of the 
groups contributing to the ancestry profiles (see also Extended Data 
Fig. 6a). Note that it is possible for distinct clusters within the UK to 
have very similar ancestry profiles: for example, two UK regions could 
receive similar contributions from a set of European groups (thus similar 
ancestry profiles) but then evolve separately (leading to different pat- 
terns of shared ancestry within and between the regions, and thus to 
distinct clusters in fines TRUCTURE). 

The bar charts in Fig. 2 show that some European groups feature 
substantially in the ancestry profiles of all UK clusters. These are: GER6 
(yellow green) found predominantly in western Germany; BEL11 (green), 
in the northern, Flemish, part of Belgium; FRA14 (light blue), in north- 
west France; DEN18 (dark blue), in Denmark; SFS31 (blue/purple) in 
southern France and Spain. In contrast, some European groups feature 
substantially in the ancestry profiles of some UK clusters but are absent 
from those of other UK clusters: GER3 (orange), in northern Germany; 
FRA12 (dark green), in France; and FRA17 (blue), also in France. Two 
Swedish groups (SWE117 and SWE121) feature in the ancestry profiles 
of the UK clusters, with Norwegian groups (shades of purple) featuring 
substantially in the ancestry profiles of the Orkney clusters, and to a 
lesser extent the clusters involving Scotland and Northern Ireland. 


Discussion 


The application of powerful haplotype based analysis methods to genome- 
wide SNP data from a large, carefully-collected, UK sample reveals a 
rich pattern of subtle fine-scale genetic differentiation within the UK, 
which shows a marked concordance with geography. Few of these details 
have been captured previously. 


Figure 2 | European ancestry 
profiles for the 17 UK clusters. Each 
row represents one of the 51 
European groups (labels at right) that 
were inferred by clustering the 
6,029 European samples using 
fineSTRUCTURE. Only European 
groups that make at least 2.5% 
contribution to the ancestry profile of 
at least one UK cluster are shown. 
Each column represents a UK 
cluster. Coloured bars have heights 
representing the proportion 

of the UK cluster’s ancestry best 
represented by that of the European 
group labelled with that colour. 

The map shows the location 

(when known at regional level) 

of the samples assigned to each 
European group (some sample 
locations are jittered and/or moved 
for clarity, see Methods). Lines join 
group labels to the centroid of the 
group, or collection of groups 
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The clustering (Fig. 1 and Supplementary Fig. 1) is notable both for 
its exquisite differentiation over small distances and the stability of 
some clusters over very large distances. Genetic differentiation within 
the UK is not related in a simple way to geographical distance. Examples 
of fine-scale differentiation include the separation of: islands within 
Orkney; Devon from Cornwall; and the Welsh/English borders from 
surrounding areas. The edges between clusters follow natural geograph- 
ical boundaries in some instances, for example, between Devon and 
Cornwall (boundaries the Tamar Estuary and Bodmin Moor), and 
Orkney is separated by sea from Scotland. However, in many instances 
clusters span geographic boundaries; for example, the clusters in Nor- 
thern Ireland span the sea to Scotland. 

Although the branch lengths of the hierarchical clustering tree in 
Fig. 1 are not easy to interpret directly, they are indicative of the relative 
differentiation between UK clusters, so that for example, the differences 
between Orkney, Wales and the remainder of the UK are substantial 
compared to some of the finer differences (splits closer to the tips of 
the tree). North and south Wales are about as distinct genetically from 
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each other as are central and southern England from northern England 
and Scotland, and the genetic differences between Cornwall and Devon 
are comparable to or greater than those between northern English and 
Scottish samples, and to those between islands in Orkney. 

To facilitate further discussion, Fig. 3 and Extended Data Fig. 7 give 
an overview of the major population groups and movements of people 
within and into the UK at different times, based on archaeological, histor- 
ical and linguistic evidence. For more detail see the Supplementary Note. 

Our observation that samples in Orkney differ genetically from those 
in the rest of the UK has been noted before'*’ and is consistent with 
the historical settlement, and long-term control of Orkney by Norse 
Vikings (Orkney was a part of Norway from 875 to 1472). Further, the 
estimated ancestry profiles of the Orkney clusters show substantial 
contributions from groups in Norway (Fig. 2). This consistency with 
history and archaeology provides external validation of our approach. 

Our approach is clearly powered to detect quite subtle levels of pop- 
ulation structure. Not finding such structure in central and southern 
England is thus informative. Although some structure may exist within 


Figure 3 | Major events in the 
peopling of the British Isles. See 
Supplementary Note for further 
details. a, The routes taken by the first 
settlers after the last ice age. b, Britain 
during the period of Roman rule. 

c, The regions of ancient British, 
Irish and Saxon control. d, The 
migrations of Norse and Danish 
Vikings. The main regions of Norse 
Viking (light brown) and Danish 
Viking (light blue) settlement are 
shown. © EuroGeographics for the 
administrative boundaries 
(coastlines). 


43-410 ap 


Hadrian’s Wall 


Romano-British 
Most heavily 
Romanized 


Romano-Gallic 


©2015 Macmillan Publishers Limited. All rights reserved 


this region, there must have been sufficient movement of people, and 
hence of their DNA, since the last major invasions of the UK to make it 
relatively homogeneous genetically. This does not require large-scale 
population movements; it could be achieved by relatively local migra- 
tion over many generations. This region of Britain lacks major geograph- 
ical and (for the most part since the Roman occupation) geo-political 
barriers to human movement. 

Other UK clusters may well reflect historical events. For example, 
several genetic clusters in Fig. 1 match the geo-political boundaries in 
Fig. 3c, and may represent remnants of communities/kingdoms pre- 
sent after the Saxon migrations, while the cluster spanning Northern 
Ireland and southern Scotland may reflect the “Ulster plantations’. The 
Supplementary Note contains further observations relating to the gen- 
etic clustering. 

Relative isolation has clearly been a major determinant of fine-scale 
population structure within the UK. To assess the role of a different 
possible cause, namely differential migration into different parts of the 
UK, we estimated European ancestry profiles for each of the UK genetic 
clusters (Fig. 2). Here we must use modern-day groupings, in Europe 
and the UK, as surrogates for the sources and results of major migra- 
tion events. Population movements between these events and the pre- 
sent, involving either the source populations or recipient groups, will 
attenuate signals of the original migration. For this and other reasons, it 
is hard to provide definitive explanations for our observations. None- 
theless, genetic differences persist through many generations and where 
we can check our conclusions against historical evidence, there is good 
concordance. In what follows we focus on the most likely explanations 
for various observations. See Supplementary Note for a fuller discus- 
sion. For definiteness, we focus on the clustering in Fig. 1 and Extended 
Data Fig. 5b, although other levels are informative. Analysis of addi- 
tional UK and European samples, particularly in regions where our 
data are sparse (for example, central Wales and Scotland, Spain, the 
Netherlands) would improve our ability to infer historical events. 

The observation (Fig. 2 and Supplementary Table 4) that particular 
European groups (for example, GER3, FRA12, FRA17) contribute sub- 
stantially to the ancestry profiles of some, but not all, UK clusters strongly 
suggests that at least some of the structure we observe in the UK results 
from differential input of DNA to different parts of the UK: the absence 
in particular UK clusters of ancestry from specific European groups is 
best explained by the DNA from those European groups never reaching 
those UK clusters. A critical observation which follows is that groups 
which contribute significantly to the ancestry profiles of all UK clusters 
most probably represent, at least in part, migration events into the UK 
that are relatively old, since their DNA had time to spread throughout 
the UK. Conversely, groups that contribute to the ancestry profiles of 
only some UK clusters most probably represent more recent migration 
events, with the resulting DNA not yet spread throughout the UK by 
internal migration. ‘Old’ and ‘recent’ here are relative terms—we can 
infer the order of some events in this way but not their absolute times. 
Although we refer to migration events, we cannot distinguish between 
movements of reasonable numbers of people over a short time or on- 
going movements of smaller numbers over longer periods. 

Applying this approach suggests a relative ordering of the peopling 
of the British Isles. For a full discussion, and caveats, see Supplemen- 
tary Note. Briefly, the earliest migrations whose descendants survive 
to make a substantial contribution to the present population are best 
captured by three groups in our European analyses, GER6 (western 
Germany), BEL11 (Belgium), and FRA14 (north-western France). These 
groups still contribute to current patterns of population differentiation 
(Fig. 2, see also Extended Data Fig. 6). Other European groups may reflect 
early migrations into the UK, but with smaller contribution, including 
SFS31 (southern France/Spain), at least part of DEN18 (Denmark), and 
possibly parts of Norway and Sweden. A subsequent migration, best 
captured by FRA17 (France), contributed a substantial amount of ances- 
try to the UK outside Wales. Although we cannot formally exclude this 
being part of the Saxon migration, this seems unlikely (see Methods) 
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and instead it might represent movement of people taking place between 
the early migrations and those known from historical records. Migra- 
tions represented by FRA12 essentially only affect Wales and Northern 
Ireland and/or Scotland. We also see clear signals of some of the known 
historical migrations and settlements, including the Saxons (GER3, 
northern Germany, and probably much of DEN18, Denmark) and the 
Norse Vikings (NOR53-NOR90). 

To further shed light on two major migration events, in Orkney and 
in central and southern England respectively, we applied a distinct ana- 
lytical tool, GLOBETROTTER” (Extended Data Figs 8 and 9). Infor- 
mally, GLOBETROTTER exploits information in the rate of decay of 
shared haplotype segments to test for the presence of recent admixture, 
to identify groups contributing, and then date the admixture. 

GLOBETROTTER detected strong evidence (P< 0.01) that the 
largest Orkney cluster (Orkney 1) was influenced by a recent admixture 
event with an overall contribution of ~25% of the DNA from groups in 
Norway, confirming that the Norwegian contribution in the ancestry 
profile for this cluster reflects recent admixture (Extended Data Fig. 9). 
The approach assumed the simplest model (a single pulse of admix- 
ture), and estimated this to have occurred 29 generations ago (95% con- 
fidence interval (CI): 18-39 generations), corresponding to year 1100 
(95% CI: 830-1418), assuming a 28 year generation time”; no clear 
evidence was found of multiple admixture dates. We expect less precise 
estimates for the other two Orkney clusters (due to their smaller sample 
size), but these were consistent with those for Orkney 1. For Cent./S 
England the method also detected an admixture event, with a contribu- 
tion of ~35% of DNA from GER3, the group in north-western Germany, 
and an estimated date of 38 generations (95% CI: 36-40 generations), 
corresponding to year 858 (95% CI: 802-914) (Extended Data Fig. 9). 
The GLOBETROTTER analyses detect likely source populations for 
the known historical migrations (Norse Vikings and Saxons, respec- 
tively) with the estimated proportion contributed by these sources close 
to that estimated in the ancestry profiles. Note that a migration event is 
likely to precede any subsequent population admixture, possibly sub- 
stantially so, if the migrants mate largely within the migrant group for 
some time after their migration. Further, admixture is likely to be a 
gradual process, so that using a model of a single pulse of admixture in 
GLOBETROTTER is likely to estimate a time after the commence- 
ment of admixture. For these reasons, the admixture dates estimated by 
GLOBETROTTER should provide upper bounds on the dates of the 
migrations”, as for both examples here, where the estimated dates are 
200 or more years after the known dates of the migrations, suggesting 
that the mixing was indeed a gradual process. 

After the Saxon migrations, the language, place names, cereal crops 
and pottery styles all changed from that of the existing (Romano-British) 
population to those of the Saxon migrants. There has been ongoing 
historical and archaeological controversy about the extent to which the 
Saxons replaced the existing Romano-British populations. Earlier genetic 
analyses, based on limited samples and specific loci, gave conflicting 
results. With genome-wide data we can resolve this debate. Two sepa- 
rate analyses (ancestry profiles and GLOBETROTTER) show clear evi- 
dence in modern England of the Saxon migration, but each limits the 
proportion of Saxon ancestry, clearly excluding the possibility of long- 
term Saxon replacement. We estimate the proportion of Saxon ances- 
try in Cent./S England as very likely to be under 50%, and most likely in 
the range of 10-40%. 

A more general conclusion of our analyses is that while many of the 
historical migration events leave signals in our data, they have had a 
smaller effect on the genetic composition of UK populations than has 
sometimes been argued. In particular, we see no clear genetic evidence 
of the Danish Viking occupation and control ofa large part of England, 
either in separate UK clusters in that region, or in estimated ancestry 
profiles, suggesting a relatively limited input of DNA from the Danish 
Vikings and subsequent mixing with nearby regions, and clear evidence 
for only a minority Norse contribution (about 25%) to the current 
Orkney population. 
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We saw no evidence of a general ‘Celtic’ population in non-Saxon 
parts of the UK. Instead there were many distinct genetic clusters in 
these regions, some amongst the most different in our study, in the 
sense of being most separated in the hierarchical clustering tree in Fig. 1. 
Further, the ancestry profile of Cornwall (perhaps expected to resemble 
other Celtic clusters) is quite different from that of the Welsh clusters, 
and much closer to that of Devon, and Cent./S England. However, the 
data do suggest that the Welsh clusters represent populations that are 
more similar to the early post-Ice-Age settlers of Britain than those 
from elsewhere in the UK. 

In summary, we have presented the first (to our knowledge) fine- 
scale dissection of subtle levels of genetic differentiation within a coun- 
try, by using careful sampling, genomic data and powerful statistical 
methods. The resulting genetic clusters, and the characterization of their 
ancestry in terms of European groups, provide important and novel 
insights into the peopling of the British Isles. 

Genetic information can augment archaeological, linguistic and his- 
torical approaches to understanding population history. It also com- 
plements them, in providing evidence relating to the bulk of ordinary 
people rather than the successful elite. We hope that our study will act 
as a proof-of-principle for the power of such detailed genetic analyses. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Samples, genotyping and QC. The sampling scheme and general information 
about the UK sample is described elsewhere’’. Briefly, the aim was to collect sam- 
ples from rural regions of the UK, for whom all four grandparents were born close 
to each other. In total 4,371 samples were collected as part of the PoBI project. Of 
these 2,886 were genotyped on the Ilumina Human 1.2M-Duo genotyping chip 
as part of the Wellcome Trust Case Control Consortium 2 (WTCCC2) studies, with 
2,510 passing the WTCCC2 genotype quality control (QC) procedures”’. We then 
applied a geographic filter, which imposed a maximum pairwise distance between 
each sample’s grandparents’ places of birth of 80 km, leaving 2,039 samples avail- 
able for analysis. In what follows we refer to these samples as the ‘UK sample(s)’. We 
give a detailed description of the choice of SNPs used for our analyses below. 

For the European ancestry profile analysis we used 6,209 samples from the 
WTCCC2 multiple sclerosis study", of which 5,682 were cases and 527 were con- 
trols. We excluded all samples from the UK and Ireland (see ‘treatment of Eire’ 
below). Extended Data Fig. 5a shows a breakdown of sample numbers by region. 
In the following text we refer to these continental European samples as the “European 
sample(s)’. The European samples were genotyped on the Illumina Human 660- 
Quad chip as previously described'*. These samples had already passed through 
the WTCCC2 SNP and sample quality control procedures". 

For all analyses we used intersections of the autosomal SNPs available for the 

UK and European data sets, constructed in the following manner: we excluded 
SNPs in the HLA region, and, for analyses involving the European samples, SNPs 
in major multiple sclerosis associated regions (although any effect of the use of 
disease samples should be small in analyses of genome-wide data). More specif- 
ically, we first took the full intersection of the UK and European data SNP sets. We 
removed a 15 Mb region surrounding the HLA region on chromosome 6 because 
the European samples were comprised of multiple sclerosis case samples, a disease 
with strong HLA associations. This left 575,236 SNPs that were transferred to the 
haplotype inference (phasing) step (see next section). Within the phasing software 
(IMPUTE2) further SNPs were excluded based on WTCCC2 quality control pro- 
cedures, which—in addition to IMPUTE2’s internal removal of SNPs due to strand 
issues or lack of overlap between the SNP array and the reference panel haplotypes— 
removed 15,211 of these SNPs before phasing. After phasing, SNPs with IMPUTE2 
info-threshold = 0.975 and SNPs that were singletons among all phased data were 
removed (these data include all POBI and European samples). This left 524,699 
SNPs. For the analyses using only the UK data (the clustering analysis labelled 
‘analysis A’ in the next section) 522,862 SNPS were used in the CHROMOPAINTER/ 
fineSTRUCTURE analyses (see next section), as the rest were monomorphic in the 
UK set of 2,039 individuals. For further analyses, using the European data (labelled 
‘analysis B’ and ‘analysis C’ in the next section), multiple sclerosis associated SNPs 
(regions defined by linkage disequilibrium around major loci of suggestive asso- 
ciation with multiple sclerosis) were removed, as well as some other SNPs for tech- 
nical reasons. In total this removed SNPs from 56.8 Mb of the genome. This resulted 
in 515,981 SNPs remaining for the analyses involving European samples. In sum- 
mary, there were 522,862 SNPs available for the UK clustering analyses, and 
515,981 SNPs available for the analyses involving European samples. A complete 
list of rsIDs is available at (http://www.well.ox.ac.uk/POBI). 
Inference of population structure. To aid in understanding we give an informal 
description of the approach we applied for inferring fine-scale population struc- 
ture. This is followed by a more detailed elaboration of our analysis. A critical 
feature of the algorithm, unlike other common approaches to detecting popula- 
tion structure such as principal components, ADMIXTURE” or STRUCTURE”, 
is that it explicitly models the correlation structure amongst nearby SNPs due to 
linkage disequilibrium, making use of the information in extended multi-marker 
haplotypes throughout the genome. This adds substantially to fineSTRUCTURE’s 
power to detect subtle levels of genetic differentiation. It has been known since the 
early HLA studies that methods that account for linkage disequilibrium are more 
informative for studies of human population structure than approaches which treat 
each locus marginally”. 

Very informally, in the fineSTRUCTURE approach, haplotype phase was first 
inferred in each sample, after which each resulting haploid genome is broken into 
pieces, in such a way that for each piece the method identifies the homologous 
piece in another individual to which it is most similar. This can be thought of as 
identifying the other individual in the collection with the most similar ancestry 
for that part of the genome (the average size of these pieces varies across indivi- 
duals, but has median 0.51 cM with IQR 0.44-0.63 cM). For each individual, one 
can tally up the number of pieces over which its genome is closest to each other 
sampled individual. These individual vectors of similarity counts are then used to 
cluster together individuals with similar ancestries, using a model-based statistical 
algorithm (fines TRUCTURE) fitted by Markov chain Monte Carlo. The choice as 
to the number of clusters, and the assignment of individuals to clusters, is made so 
as to maximise the posterior probability under the probability model used for 
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clustering in fineSTRUCTURE. In the PoBI analysis, this yields 53 clusters of indi- 
viduals. Similar clusters are then merged hierarchically to give a tree which can be 
used to describe population structure at different levels of granularity, as we describe 
below. 

More formally, haplotypes were inferred (phased) jointly for all individuals 
used in the study (that is, the UK and European samples) with IMPUTE2”, using 
the default values (see http://mathgen.stats.ox.ac.uk/impute/impute_v2.html#mcmc_ 
options). The reference data used are available from the IMPUTE2 website (http:// 
mathgen.stats.ox.ac.uk/impute/impute_v2.html#reference). 

Next, we used the algorithm implemented in the CHROMOPAINTER program'” 
to represent the DNA of individuals as mosaics of the DNA from other individuals. 

We performed three separate CHROMOPAINTER analyses: 

A. Form each haplotype ofa UK individual as a mosaic of all UK haplotypes exclud- 
ing those of that individual. 

B. Form each UK haplotype as a mosaic of all European haplotypes. 

C. Form each haplotype of a European individual as a mosaic of all European hap- 
lotypes excluding those of that individual. 

For each analysis, A—C, we ran the algorithm implemented in CHROMOPAINTER 
as recommended by the authors, except for a minor change to the value of a single 
parameter for analysis A, implemented for technical reasons. Specifically, we 
initially applied CHROMOPAINTER toa subset of individuals and chromosomes 
(chosen as described below) using 10 iterations of its expectation-maximization 
(EM) algorithm to infer the genome-wide average switch and global emission 
rates in CHROMOPAINTER’s Hidden Markov model. We averaged the inferred 
values of each across the chromosomes and individuals used, weighting chromo- 
somes by their relative size, and fixed these final switch and global emission rates 
in a final run of CHROMOPAINTER on all individuals and chromosomes. This 
final CHROMOPAINTER run gave the final ‘counts’ and ‘lengths’ values used in 
all subsequent analyses. For analysis A, we inferred switch and global emission 
rates averaging across chromosomes 4, 10, 15, 22 (using weights of 187, 131,81 and 
34, respectively) and 20 individuals from each of 30 UK sample regions (counties 
or districts from which the PoBI samples were collected, from across the whole UK), 
starting with an initial switch rate of 400,000/(2Nyx), where Nyx is the number of 
samples used for the UK analyses, and a default emission rate. For analyses B and C, 
we inferred switch and global emission rates averaging across chromosomes 1, 8, 
15, 22 (using weights of 219, 142, 81 and 34, respectively) for 20 individuals from each 
of 30 United Kingdom regions, and 20 individuals out of every 200 in a combined 
file of all European subjects, starting with an initial switch rate of 400,000/(2Nx), 
where Ng is the number of samples used for the European analyses, and a default 
emission rate. Previous work with CHROMOPAINTER has shown that deviations of 
the switch rate (even up to a factor of 10) have little effect on CHROMOPAINTER’s 
inference (data not shown). Finally, for analysis C, we set the expected number of 
haplotypic segments to define a region (that is, the ‘-k’ switch) to CHROMOPAIN 
TER’s default value of 100 in order to estimate a normalization parameter (denoted 
by ‘c’) subsequently used by the clustering program fineSTRUCTURE”. In con- 
trast, we set this value to 50 (that is, using ‘-k 50’) for analysis A. This slight 
deviation from CHROMOPAINTER’s default value was implemented for analysis 
A because some UK individuals shared relatively long haplotype segments with 
other UK haplotypes, such that they did not always have 100 total such segments 
across the entirety of some of the smaller chromosomes. We used the June 2008 
build 36 genetic map from the HapMap webpage (http://hapmap.ncbi.nlm.nih. 
gov/downloads/recombination/2008-03_rel22_B36/rates/). 

CHROMOPAINTER provides estimates of the counts of haplotype segments and 
total length of DNA (in cM) for which an individual shares most recent common 
ancestry with a set of other individuals. When summed across all 22 autosomes we 
refer to the vector of these counts as the ‘copying profile’ for that individual. For 
example, in analysis B, CHROMOPAINTER gives the counts of haplotype seg- 
ments and total length of DNA for which each UK individual shares most recent 
common ancestry with each European individual. These values are given for chro- 
mosome 1-22 of each UK individual, and are also summed to give a genome-wide 
total across the autosomes (in the case of the counts data, the copying profile). 
Furthermore, within a UK individual, these values can be summed across any group- 
ing of European individuals (for example those sampled from the same geographic 
region or assigned to the same European group, see below) providing an estimate 
of the counts of haplotype segments and/or total length of DNA for which each UK 
individual shares most recent common ancestry with any European group (a group 
copying profile). It is natural to average these values across UK individuals assigned 
to the same cluster (see below) to get average values for all UK individuals from a 
particular cluster; a “copying vector’ for the cluster as a whole. 

For analyses A and C, described above, we used the algorithm implemented 
in the program fineSTRUCTURE” to group the UK and European individuals 
respectively into genetically relatively homogeneous clusters. The fineSTRUCTURE 
program takes as its input the counts of haplotype segments for which each individual 
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shares recent common ancestry with every other as inferred by CHROMOPAINTER 
(summed across all chromosomes, the copying profile). The choice to use counts 
in this analysis is motivated by the underlying ‘painting’ model used by CHROMO 
PAINTER, in which segments are shared with individuals chosen independently 
from one another, and there is a constant switch rate between segments. Under 
this model, each segment provides an equal amount of independent information, 
while segment lengths are uninformative, so the segment counts provide a natural 
basis for inference, and this is why they are used. However, we note that in practice 
fineSTRUCTURE attempts to allow for departures from this modelling assump- 
tion (which is expected to only be an approximation) through a scaling parameter 
on the (log-) likelihood. Moreover we believe there is often useful information 
provided in, for example, the fact that segments shared between genuinely closely 
related groups tend to be longer on average, akin to the idea of long segments shared 
‘identical by descent’ with respect to some founder population. Exploring and using 
this length information may provide an interesting topic for future work. 

Weinitially put all of our individuals into a single cluster at iteration 0, but other- 
wise used default values when running fineSTRUCTURE (see ref. 15 for details). 
Each Markov Chain Monte Carlo (MCMC) iteration of fineSTRUCTURE provides 
the number of clusters and the cluster membership of each individual, sampled 
according to their posterior probabilities under the fineSTRUCTURE model. We 
sampled values every 10,000 iterations for 1 million MCMC iterations following 
either 1 million (analysis A) or 3 million (analysis C) ‘burn-ir” iterations. Starting 
from the MCMC sample with the highest posterior probability among all samples, 
fineSTRUCTURE performed 100,000 additional hill-climbing moves to reach its 
final inferred state. 

Next we undertook an additional step to improve fineSTRUCTURP’s inference 
for cluster membership. This is an addition to the fineSTRUCTURE algorithm”. 
While fineSTRUCTUREP'’s final inferred state has been shown to give reasonable 
results in practice”, it relies heavily on a single MCMC sample observation. Although 
this single sample is the one with maximum posterior probability among all MCMC 
samples, the probability has been calculated assuming fixed (sampled) values for a 
large number of parameters that include the total number of clusters, each individual’s 
final inferred cluster assignment, and other modelling parameters. Therefore, a con- 
cern is that the posterior distribution will be relatively flat across such an extensive 
state space, such that fairly divergent parameter values may result in similar pos- 
terior probabilities. In contrast, the marginal posterior distribution of each indi- 
vidual’s cluster assignment across all MCMC runs should be substantially more 
informative, improving the assignment of individuals to clusters. Informally, by 
chance alone any given individual may not be in its own optimal (highest proba- 
bility) cluster in the final inferred state, despite the overall posterior probability 
being at its maximum. We thus seek to reassign any such individuals to their most 
probable cluster. We therefore leverage the marginal information of each indivi- 
dual’s cluster assignment from the values of the MCMC samples recorded every 
10,000 iterations (see above) in order to re-assign individuals to clusters. Specifi- 
cally, assuming we have N total individuals and M MCMC samples, and starting 
from the K clusters in fineSTRUCTURE’s ‘final inferred state’, we performed the 
following procedure: 

1. We find the number x;” of individuals that cluster with individual i (including 
individual i itself) in MCMC sample m, for i= 1,...,N and m = 1,...,M. 

2. We furthermore find the number yin <x; of individuals that both cluster 
with individual iin MCMC sample m and that are in cluster k of the final inferred 
state, for k = 1....,K. 

3. We re-assign each individual i to the cluster k with the maximum value of 
Yona. Die? / x;"”] across all kin 1,...,K. These re-assignments give a new final 
inferred state; note these re-assignments can reduce the total number of clusters K. 
4. We repeat steps 1-3 for 50 iterations. 

This procedure gives the final cluster assignments for each individual. 

One feature of this additional procedure used for reassigning individuals to 
clusters is that we obtain measurements of the confidence in the assignment of 
each individual i to each cluster k. For each individual i, the values of )7 in = 1. 
bk”? J xf ] from the final iteration can be normalized across k to sum to one, 
and stored in the K-vector Px ;. These quantities have a natural interpretation as a 
measure of the confidence associated with the assignment of individual i to each 
cluster k. Note that we assign individual i to the cluster k for which the value of the 
measurement is maximal. Call this maximal value Px maxi It is possible to apply a 
threshold t, 0 <t < 1, to the assignment of individuals to clusters so that an indi- 
vidual is only assigned to a cluster if Px_ max, > t. Ifnot, then the individual may be 
removed from subsequent analyses. We investigated the effect of setting such a 
threshold ¢. The main observation is that applying a threshold has very little effect 
on the make-up and distribution of clusters across the UK, nor on downstream 
analyses (data not shown). For further discussion see Supplementary Note. 

One possible consequence of this extra procedure is to reduce the final number 
of clusters inferred from that of the so-called final inferred state. For analysis A, 


the final number of UK clusters inferred, after the extra procedure, is 53 (the initial 
final inferred state had 55). For analysis C the final number of European groups 
inferred is 145 (no change to the initial final inferred state). 

We assessed convergence of the fineSTRUCTURE MCMC runs in various 
ways. This included running independent chains, and comparing aspects of the 
assignments of individuals to clusters, and the results of downstream analyses, 
between the two chains. Reassuringly, given the size of the state space being ex- 
plored, these diagnostics confirmed mixing of the MCMC chains (Extended Data 
Fig. 2). 

Using the final assignments, we used fineSTRUCTURE to construct a ‘tree’ in 
the default manner described in ref. 15 by successively merging pairs of clusters. 
Starting at the final cluster assignments, fineSTRUCTURE merged the pair of clus- 
ters whose merging gave the smallest decrease to the posterior probability among 
all possible pairwise merges. This gives the next level up in the tree (with one fewer 
cluster). We repeat this merging process at the new level and continue until just two 
clusters remain. Figure 1 shows the assignment of individuals to clusters for the 
level of the tree when 17 clusters remain. The final cluster assignments and the 
assignments of individuals to clusters at all levels of the tree are provided in Sup- 
plementary Figs 1.1-1.24 for the UK clustering analyses (A). The tree so obtained is 
a hierarchical clustering tree and should not be interpreted as a phylogeny. None- 
theless there is information about the strength of the differentiation between clus- 
ters in these trees. 

It is possible to use the vectors of measures Px,; defined above, of the confidence 

associated with the assignment of individual i to each cluster k in the final inferred 
state, to reassign individuals to clusters at any level of the tree. Consider the fol- 
lowing. Define the lowest or finest level of the tree, the level relating to the final 
cluster assignments, to be Lx, where K is the number of clusters in the final inferred 
state. Then define each level of the tree to be L;, where J in (2, 3, ..., K) is the number 
of clusters at the level of interest. For a given level of the tree L;, each cluster C;,;, j in 
(1, ..., J), is made up of one or more clusters at the lowest level of the tree, merged 
into a single cluster. For example, the large UK cluster in central and southern 
England at the level containing 17 clusters (depicted in Fig. 1, red squares) is the 
union of eleven smaller clusters from the final inferred state. For each individual i 
it is possible to define a new J-vector of measures P,;, for level L;, where for each 
cluster C,; we sum the values in Px, for all clusters that are merged to form C;,, and 
store the result in component j of P,;. Thus, for our previous example of the large 
cluster in central and southern England at the level containing 17 clusters, for each 
individual i we sum the values relating to the eleven constituent clusters at the final 
inferred state that make up this larger cluster, and use this as the measure of con- 
fidence that the individual i is assigned to the larger cluster. We can use the vector 
of measures P;,; so-defined to reassign individuals to the cluster for which P;,; is 
maximal. This will potentially result in some individuals being reassigned to a dif- 
ferent cluster from the one to which they were assigned by the standard tree build- 
ing method. For example, we see this has occurred for exactly one individual in 
Extended Data Fig. 1, resulting in the different total numbers assigned to the red 
square and purple cross clusters in Extended Data Fig. 1 when compared to Sup- 
plementary Information Fig. 1.16 (both depicting 17 clusters). One advantage of this 
process is that we can interpret P, ; as a measure of the confidence of the assignment 
of an individual i to each cluster at the given level L;. We can also set a threshold t 
and examine which individuals have lower confidence assignments to their cluster, 
where by ‘lower confidence’ we mean that the maximum value in the vector P;,; is 
less than t. We depict this for the UK clustering at the level of 17 clusters in Ex- 
tended Data Fig. 1, when we set t = 0.7. 
Other methods for detecting population structure. We implemented principal 
components analysis (PCA) using the package MMM”. We applied PCA to the 
intersection of the SNPs used for PCA in the WTCCC2 project” and the SNPs 
passing quality control filters in UK sample in this paper. This resulted in 188,329 
SNPs with minor allele frequency >0.05 in the UK population. These SNPs are 
distributed approximately evenly with respect to the genetic distance across the 22 
autosomes. We excluded all SNPs in regions with unusually high loadings based 
on visual inspection of the first 20 axes of PCA applied to the UK control samples 
of WTCCC2. The results are shown in Extended Data Fig. 3a. 

Wealso applied the program ADMIXTURE" to these same data, using default 
settings as recommended by the authors. The ADMIXTURE model effectively as- 
sumes independence of the markers used across the genome. We ran ADMIXTURE 
three times, corresponding to three different choices for the number of clusters to 
be used for classification (K). To understand the method in the simplest cases we 
set K = 2 and K = 3, and for comparison to the results presented in our main ana- 
lyses we set K = 17. The results are shown in Extended Data Fig. 3b. 
Continuous or discrete frameworks for modelling and inferring population 
structure. There is a general issue when modelling genetic variation from spa- 
tially structured populations as to whether to use models which characterize the 
population as comprised of distinct subpopulations, or at the other extreme to model 
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the population in continuous space, without distinct subgroups, where isolation by 
distance is the primary factor in giving rise to geographical substructure’®**?””*. 
Both are obviously oversimplifications for natural populations, and in particular 
for humans, and are more naturally thought of as caricatures and as endpoints of a 
spectrum, with debate as to which might be closer to capturing the important fea- 
tures of historical human demography. 

One potential criticism of the fineSTRUCTURE approach is that it is embed- 
ded in a framework of discrete subgroups. There is an obvious sense in which 
fineSTRUCTURE is closer to this framework: it explicitly estimates a set of sub- 
groups in the population, on the basis of patterns of shared ancestry. Although 
this is a description of the population, rather than a model of it, it might well be 
more natural or useful if there is, in reality, some underlying discreteness. On the 
other hand, the hierarchical tree estimated by fines TRUCTURE allows viewing of 
the population at multiple levels of clustering. This does not stipulate a fixed num- 
ber of subgroups, and instead provides a complex description of the underlying 
structure—in effect zooming in from the coarsest partition of the population as 
two subgroups to examine finer and finer partitions. Taken together, we argue 
that this approach is better suited to capturing the complexities of real popula- 
tions than had it only described a single set of discrete subgroups. Our approach, 
of probabilistically classifying individuals into groups at a particular level, rather 
than forcing them to belong to exactly one cluster, also allows some flexibility in a 
world where there is smoother variation with geography. 

Clearly some, but not all, aspects of human demography will be influenced by 
the dynamics of isolation by distance. Conversely, cultural, linguistic, and geograph- 
ical barriers will all tend to encourage boundaries, and hence discreteness of sub- 
groups. We are encouraged by the fact that the multi-level descriptive framework 
of fineSTRUCTURE, as applied to the subtle levels of population structure within 
the UK, is clearly capturing real effects, as evidenced for example by the concordance 
with geography, largely non-overlapping clusters (compared with ADMIXTURE), 
confident assignment of individuals to clusters in most cases (typically except where 
clusters overlap geographically), and its ability to detect groups which reflect known 
historical events. 

Estimating ancestry profiles. To understand the genetic make-up of different ge- 
netic clusters in the UK with respect to potential ancestral populations we performed 
the following analyses. For analysis B (above) the CHROMOPAINTER algorithm 
provides estimates of the proportion of each UK individual's DNA that is most closely 
related ancestrally to each European individual, among all the sample members. 
These proportions can then be summed across groups. These proportions approx- 
imate the fraction of an individual’s DNA that coalesces, back in time, most re- 
cently with each particular sampled individual'*. Because in humans these coales- 
cence events can be far back in time relative to population separation times, we 
expect them to often predate population splits (that is, we expect incomplete lin- 
eage sorting). This leads to differences in the amount of DNA copied from differ- 
ent European groups being subtle, in a sense adding noise. The amount of noise 
depends on the number of individuals sampled—and thus potentially sharing DNA— 
in the different groups, with larger sample sizes likely to reduce noise. In addition, 
we rely on informative variation patterns to identify individuals from whom DNA is 
copied, adding additional noise, which may systematically vary across the genome. 
To account for this noise we follow ref. 22, so that at each level of the hierarchical 
clustering tree of the UK samples, and for a fixed level (see main text) of the Euro- 
pean samples’ hierarchical clustering tree, we perform multiple linear regression 
as follows. For each level of the hierarchical clustering tree of the UK samples, and 
for the set of G (= 51) groups inferred for Europe we perform the following linear 
regression. Let Yp bea G-vector describing the average proportion of DNA genome- 
wide that a cluster P of UK individuals copies from each of G groups of European 
individuals, as inferred by CHROMOPAINTER. That is, element g of Yp consists 
of CHROMOPAINTER’s total genome-wide length (in cM) of all haplotype seg- 
ments inferred to be most closely related ancestrally to any individual of European 
group g, normalized to sum to unity across all gin J,...,G within a UK individual 
and then averaged across all individuals in the UK cluster P. We use copying lengths, 
rather than counts (used in the clustering itself), for this analysis because all indivi- 
duals have the same total genetic length, but this length may be broken into differing 
numbers of copying segments in different individuals. Thus it is straightforward to 
interpret coefficients in the below linear regression, in terms of the fraction of the ge- 
nome contributed by different components in the mixture, using copying lengths, 
but interpretation would be more difficult using counts of shared DNA segments. 
Analogously, let X, be a G-vector describing the average proportion of DNA that the 
European individuals of group g copy from each of the G European groups as inferred 
by CHROMOPAINTER, including their own group (though note individuals are not 
allowed to copy from their own haplotypes in CHROMOPAINTER). We assume 


Yp=f,X, +B,X2. +... + heXe, (1) 
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and solve simultaneously for the , under the restriction that each 6,20 and 


G 
> B, =1, using a slight adaptation of the non-negative-least-squares (nnls) func- 
g=l 

tion in the statistical software package R (see ref. 29). 

We interpret the inferred value for fy as the average proportion of genome- 
wide DNA ofa UK individual from cluster P that is most closely related ancestrally 
to European group g. We refer to these vectors as ‘ancestry profiles’. 

To assess statistical uncertainty in our estimates of the By for each UK cluster P, 

we perform a bootstrap procedure where we re-sample the chromosomes of the 
Np UK individuals in this group (constructing pseudo-individuals by sampling 
pairs of chromosomes for each of the autosomes). In particular, for each boot- 
strap iteration, we randomly sample the G-vector of CHROMOPAINTER output 
across these UK individuals Np times with replacement for each chromosome 
1-22. We then generate each of Np ‘pseudo-individuals’ by randomly summing 
22 pairs of these samples (without replacement), one pair per chromosome, and 
then summing across the first, respectively the second, member of each pair before 
rescaling the resulting G-vectors to sum to unity. Averaging each element of the 
G-vectors across these Np pseudo-individuals gives us a new re-sampled value of 
Y>, which we then substitute into (1) above to generate new inferred values of the 
Bg. We repeat this procedure 1,000 times, reporting the inner 95% quantiles of the 
sampling distribution for a given European group g across these 1,000 bootstrap 
re-samples (see Supplementary Table 4 and Extended Data Fig. 6a). 
Assessing the strength and robustness of the inferred population structure— 
Fg7, identity by descent (IBD) and total variation distance (TVD). Using the same 
set of SNPs that were used for the PCA analyses (see above) we analysed pairwise 
Fs; both between the sample collection districts, and between the 17 inferred clusters 
from our main analysis using the method implemented in the program Eigensoft*®. 
The complete matrices of pairwise Fsy values are given in Supplementary Tables 1 
and 2. 

To investigate the effect that recent shared ancestry may have on our analyses 
we calculated a measure of pairwise IBD and compared its distribution within 
clusters to its distribution across the whole sample. This measure uses a hidden 
Markov model (HMM) to estimate IBD across the genome". The measure is likely 
to be useful when the shared relatedness is just a few generations in the past, allow- 
ing the identification of pairs of individuals in our UK sample that are reasonably 
closely related. The results are plotted in Extended Data Fig. 4. Reassuringly, these 
confirm that levels of relatedness within clusters are typically similar to those between 
clusters, and hence that our observed clusters are not an artefact of a sampling scheme 
which preferentially selected closely related individuals from regional localities. 

To quantify the strength of differences between the inferred clusters we perform 
the following analyses. As noted above we can summarize the copying profiles of all 
the samples in a given cluster X to produce a characteristic ‘copying vector’ x = (x, 
X25.+.sXn); the average (across individuals in cluster X) proportion of each indi- 
vidual in cluster X’s closest ancestry that is attributed to individuals from each of 
the clusters, Y = (Yj, Yo,...,Y;,), where n is the number of inferred clusters. In fact, 
this copying vector can be calculated for any group of samples (that is, not only 
the inferred clusters). One can use these vectors to test if the clusters inferred by 
fineSTRUCTURE are capturing significant differences in ancestry, and to give a 
sense of the strength of the differences observed. Given a pair of inferred clusters (A 
and B) and their copying vectors (a and b respectively) one can calculate the total 
variation distance (TVDcy) between the pair: 


TVDcy(A,B) =0.5 x 3° (|a;—B;l). 


i=1 


TVDcy can be interpreted as a measure of the difference between the two clusters. 
(As the copying vectors are discrete probability distributions over the set of clus- 
ters, total variation distance is a natural metric for quantifying the difference 
between them.) 

Furthermore, given a pair of clusters (A and B) one can randomly reassign the 
individuals in the clusters, maintaining the cluster sizes, to obtain a new pair of 
clusters (A’ and B’, of the same size as A and B, respectively). One can then cal- 
culate the copying vectors (a’ and b’) for the new clusters A’ and B’, and the total 
variation distance between them. Repeating this process m times one can obtain a 
P value from a permutation test of the null hypothesis that, given the cluster sizes, 
the individuals in the two clusters are assigned randomly to each cluster. Here the 
Pvalue is the proportion of the m permutations where TVDcy(A’,B’) > TVDcv (A,B). 
Supplementary Table 3 shows the value of the TVDcy statistic for all pairs of the 
17 clusters used in our main analyses. 

Similarly, rather than using the copying vectors for a pair of clusters (A and B), 
one can use the ancestry profiles of the clusters (~ and f) to calculate the total var- 
iation distance between the ancestry profiles of a pair of clusters (TVDap): 
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TVDqp(A,B) =0.5 x > (24 —B;|) 


i=1 


TVDap can be interpreted as a measure of the difference between the ancestry 
profiles of the two clusters. (Again, as ancestry profiles are discrete probability 
distributions, total variation distance is a natural metric for quantifying the dif- 
ference between them.) 

As above, one can permute the individuals that are assigned to each cluster, 

maintaining the cluster sizes, and calculate the ancestry profiles of the resulting 
clusters (A’ and B’) and the total variation distance between them. As before, repeat- 
ing this process m-times one can obtain a P value from a permutation test of the 
null hypothesis that, given the cluster sizes, the individuals in the two clusters are 
assigned randomly to each cluster with respect to their ancestry profile. Here the 
P value is the proportion of the m permutations where TVD ap (A’,B’) > TVDap(A,B). 
Supplementary Table 5 gives TVD ap for all pairs of ancestry profiles for the 17 UK 
clusters used in our main analyses, and gives the associated P values based on 1,000 
permutations. 
Assessing the accuracy and robustness of the ancestry profiles. We undertook a 
number of simulation studies, generating data with similar properties to the actual 
data, to assess the accuracy of the estimated ancestry profiles. These suggested good 
accuracy of the major components of our estimated ancestry profiles. 

A major challenge for this kind of simulation study is in simulating data which 
has similar properties to the real data. The subtle similarities and differences within 
and between our various UK clusters and European groups are generated by their 
complicated shared and distinct demographic histories. This true demographic 
history is unknown and might not be well approximated by simple models that can 
be simulated from, and so it is not possible to simulate realistic data from the appro- 
priate model. Instead, we used subsamples of the real data for our simulation 
studies. This has the advantage that it replicates patterns in the real data, but the 
disadvantage that simulation studies must be based on smaller sample sizes than 
the actual study. (Since some of the data are needed to simulate the scenario of 
interest, and the rest of the data to analyse that scenario, so neither the simulated 
data set, nor the data used for analysis, can be as large as the actual data set.) 

For each simulation scenario described below, we generated N simulated indi- 
viduals as mixtures of two populations A and B intermixing / generations ago in 
proportions «, 3 (= 1 — «) respectively, closely following established approaches***”’. 
Informally, to simulate an admixed haploid chromosome we did the following: a 
genetic distance x (in centimorgans) was sampled from an exponential distribution 
with rate 7/100. The first x cM of the simulated chromosome was composed of the 
first x cM of a real data chromosome selected randomly from either population A 
or Baccording to the proportions of admixture « and f (the specific values used are 
given below). Then a new genetic distance was sampled from the same exponential 
distribution (rate = 1/100), and the process repeated until an entire simulated 
chromosome was generated. This was repeated for all 22 autosomes, resulting 
ina single (haploid) set of chromosomes for one individual. We did this 2N times, 
generating 2N full sets of haploid autosomes. (To limit the chance of multiple sim- 
ulated individuals copying from the same real data individual at any location in the 
genome, wherever possible the new piece of chromosome sampled was selected 
from the pool of chromosomes in the selected population (A or B) for which no 
other previously simulated chromosome had copied at the same location. When 
this was not possible, a chromosome was selected at random from the selected pop- 
ulation (A or B), see ref. 22. Diploid individuals were constructed by aggregating 
two full sets of haploid chromosomes, making N simulated individuals in total. 

We considered three scenarios of two populations admixing, and for each of 
these scenarios we considered three proportions of admixture for the second group 
(6 = 0.1, 0.25 and 0.5). This yielded the following nine sets of simulations: 

(1) ‘Italy and northern Germany’: N = 25, 4 = 40, f = 0.1, 0.25 and 0.5, derived 
by mixing 30 randomly sampled individuals from the Italian Group ITA36 (which 
contains 284 individuals) with 10 randomly sampled individuals from GER3 (58 
individuals). 

(2) ‘North Wales and Norway’: N = 40, 4 = 29, 6 = 0.1, 0.25 and 0.5, derived by 
mixing 75 individuals from the N Wales cluster with 10 randomly sampled indi- 
viduals from NOR72 (116 individuals) and 10 from NOR71 (148 individuals). 
(3) ‘North Wales and Denmark’: N = 25, A = 40, f = 0.1, 0.25 and 0.5, derived by 
mixing 75 individuals from the N Wales cluster with 20 randomly sampled indi- 
viduals from DEN 18 (319 individuals). 

These simulations were chosen both to test our model’s ability to infer sources 
of admixture and their proportions from distinct European groups (simulation 1); 
as well as to mimic admixture events we infer in our main analyses, that is, relating 
to the Norwegian Viking (simulation 2) and Anglo-Saxon (simulation 3) migra- 
tions into the UK. Simulations (2) and (3) use samples from the N Wales cluster, 
which we infer has little evidence of DNA influx from the Norwegian Vikings and 
Anglo-Saxons, and mixes them with groups containing primarily individuals sampled 


from Norway (2) or from Denmark (3). These simulations are used to model 
admixture between the ‘ancient’ British population (that is, genetically constituted 
as it was before the Saxon invasion) and Norwegian Viking or Anglo-Saxon settlers, 
respectively. Simulation (2) further assesses our model’s ability to distinguish two 
distinct Norwegian sources of admixture from among 12 different groups prim- 
arily containing samples from Norway. 

For each simulated data set we estimated ancestry profiles as follows. We used 
CHROMOPAINTER (see above) to represent each of the 2N simulated haplotypes 
as a mosaic of all the European haplotypes except those used for the relevant 
simulations. Specifically, the 40 samples from ITA36 and GER3 used for the sim- 
ulations in the ‘Italy and Northern Germany’ scenario (1) were removed from the 
CHROMOPAINTER analysis for scenario (1). Similarly the European samples 
used for the simulations in (2) and (3) were removed in their respective CHROM 
OPAINTER analyses. This ensures that the actual admixing individuals are not 
sampled when forming the mosaics. We used the estimated switch and emission rates 
from the main analysis, described in ‘Inference of population structure’ above. 

Recall the ancestry profiles are determined by fitting a linear mixture model 
that utilizes both the CHROMOPAINTER copying profiles derived from making 
up the ‘target group’ (here the simulated samples, in our main analysis the UK 
samples within a cluster) haplotypes from the ‘source groups’ (here the European 
samples except those used in the simulations, in our main analysis all the Euro- 
pean samples) haplotypes, as well as the CHROMOPAINTER copying profiles 
used for the clustering of the ‘source groups’. To obtain the latter we adapted the 
results from the existing CHROMOPAINTER analysis C (see ‘Inference of popu- 
lation structure’ above) as follows. (It would be computationally prohibitive to 
rerun the full CHROMOPAINTER analysis for each of the nine simulated data 
sets.) For each European individual’s copying profile the elements associated with 
the European samples used in the simulations were removed. Then, for each of 
the 51 European groups, we averaged these adjusted copying profiles across all 
individuals assigned to the given group (excluding any individuals used in the rele- 
vant simulations) as described in ‘Inference of population structure’, and used the 
adjusted copying profiles for the 51 EU groups as covariates in our linear mixture 
model as described in estimating ancestry profiles. 

This post-hoc adjustment of the copying profiles for each of the 51 European 
groups assumes that if we had repeated the CHROMOPAINTER analysis for the 
relevant reduced set of European samples, the copying profile of the parts of the 
chromosomes previously associated with the removed samples is redistributed 
evenly across all the other European individuals. This is inherently conservative 
as it is more likely that by excluding, for example, 10 of the 58 GER3 samples from 
the ‘new GER3 group would have resulted in an increase of copying from the 
other 48 GER3 samples, relative to the increase in copying from individuals from 
other European groups. Thus the performance of our approach for determining 
ancestry profiles in our simulation study is likely to be an under-representation of 
the performance of our approach in the main data analyses. 

Furthermore, we only used a relatively small number of individuals from each 
of ITA36, GER3, NOR71, NOR72 and DEN18 in the simulations, to ensure a suf- 
ficient number of remaining individuals from each to use for inferring the ances- 
try profiles. As a consequence, the number of simulated individuals we generated 
is rather small, consisting of only 25 or 40 individuals per simulation, compared 
to our main analysis (using the real data) where many of the clusters were signifi- 
cantly larger. We expect the increased sample size for the majority of clusters used 
in our main analyses to improve our inference of ancestry profiles relative to the 
simulations, substantially so in some cases such as Cent./S England which contains 
1,044 individuals. 

We also adopted an alternative simulation approach for the scenarios repre- 
sented by (1)-(3) above using a forwards-in-time simulation method, initialised 
from real data, as previously described”. In each case, we combined a subset of the 
same randomly sampled individuals from populations A and B above (for example, 
for (1), the 30 individuals from ITA36 and the 10 individuals from GER3) into a 
single pool population, which we then simulated forwards in time for the same 2 
generations as used above. To imitate the three simulations for scenario (1) above 
this pool population contained respectively (20, 60, 60) haplotypes from ITA36 
and (20, 20, 7) haplotypes from GER3 to approximate admixture contributions of 
(0.5, 0.25, 0.1) from GER3. Similarly for scenario (2) the pool population contained 
(40, 120, 150) haplotypes from N Wales and (40, 40, 18) haplotypes from NOR72/ 
NOR71 (half from each); and for scenario (3) (50, 150, 150) haplotypes from N 
Wales and (50, 50, 18) haplotypes from DEN18. 

To create the next generation of haplotypes following this admixture event, we 
randomly sampled two distinct parental haplotypes (each comprising a full set of 
22 single chromosomes from one individual) from the pool. We composed a new 
set of haplotypes for an individual in the next generation as a mosaic of chunks from 
these two parent sets, with switches in the mosaic based on the HapMap Phase 2 
genetic map (June 2008, build 36 genetic map, as above). More specifically, we 
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determined the number of recombination breakpoints on each chromosome by 
summing a random sample from a Bernoulli distribution with probability 0.5 (which 
models the expected obligate crossover per generation per chromosome) and a 
random sample from a Poisson distribution with rate equal to the total genetic length 
of the chromosome in Morgans minus 0.5 (which models the remaining crossovers). 
We then sampled the physical location of each of the breakpoints independently 
according to their relative genetic map value, copying segments on either side of a 
breakpoint without mutation from the chromosome’s two different parents. In the 
first generation after the admixture, we repeated this process to generate 500 full 
(that is, chromosomes 1-22) sets of haplotypes. For the remaining 1 —1 genera- 
tions, 500 new full sets of haplotypes were each simulated in the same manner as a 
mosaic of chunks from two distinct full sets of haplotypes randomly sampled with 
replacement from the previous generation. After A generations, we randomly sam- 
pled distinct haplotypes (that is, without replacement) to form N individuals for 
subsequent analysis, where Nis the sameas in the relevant scenario of (1)-(3) above. 
We then inferred ancestry proportions in these N simulated individuals in the 
same manner described above. 

The resulting ancestry profiles from all 18 simulation studies (2 simulation methods 

times 3 scenarios times 3 admixture proportions (/)) are given in Supplementary 
Table 6. 
Dating admixture events in Orkney and southeast England. We ran GLOBE- 
TROTTER” to estimate the time of the major admixture events contributing to the 
make up of the Cent./S England cluster and the three clusters in Orkney (Westray, 
Orkney 1 and Orkney 2) using the 51 European groups as surrogates for the puta- 
tive admixing ‘source groups’ (that is, using analysis B from ‘Inference of popula- 
tion structure’ above) and assuming a single ‘pulse’ of admixture when analysing 
each UK cluster. We closely follow the application of GLOBETROTTER as described 
by the authors. In short, CHROMOPAINTER identifies the segments of DNA within 
each UK individual’s genome that are most closely related ancestrally to each Euro- 
pean group, as described in ‘Inference of population structure’. GLOBETROTTER 
measures the decay of association versus genetic distance between the segments 
copied from a given pair of European groups. Assuming a single pulse of admixture 
between two or more distinct admixing source groups, theoretical considerations 
predict that this decay will be exponentially distributed with rate equal to the time 
(in generations ago) that this admixture occurred”. GLOBETROTTER jointly fits 
an exponential distribution to the decay curves for all pairwise combinations of 
European groups and determines the single best fitting rate, hence determining 
the most likely single admixture event and estimating the date it occurred. Instead 
of requiring specific genetic surrogates to represent each admixing source group 
involved in the admixture, as in other dating approaches such as ROLLOFF”, 
GLOBETROTTER aims to infer the haplotype composition of each source group 
for the admixture as a linear combination of those carried by sampled groups (that 
is, a linear combination of the 51 European groups). This results in the admixed 
group themselves automatically being represented in the same form—as a mixture 
of mixtures—consistent with the linear estimation procedure we applied for each 
UK group, before estimating admixture dates for each group. 

The following provides more details on our approach for dating and estimating 
admixture proportions within a single UK cluster; full details of the GLOBETROTTER 
method are provided in ref. 22. For each haploid set of chromosomes of each indi- 
vidual from a given UK cluster, we consider the genome-wide mosaic inferred by 
CHROMOPAINTER in the UK on Europe analysis (analysis B from ‘Inference of 
population structure’ above). In this manner each UK sample’s haploid genome is 
pieced together as a series of ‘chunks’, with each chunk a contiguous segment of 
DNA best matching a European sample inferred to be most closely related ances- 
trally to that segment. We note that CHROMOPAINTER infers these mosaics for 
each individual many times in a probabilistic manner, so we can sample from the 
set of mosaics for a given individual. We sampled 10 such mosaics for each haploid 
genome of each UK individual in the cluster we are focusing on, giving 20 total 
mosaics for each UK individual. 

Consider two of these 20 mosaics (these two could be the same sampled mosaic). 
We compare each chunk on mosaic 1 to each chunk on mosaic 2. For each pair of 
chunks, we record the two European groups (perhaps the same) copied at each chunk 
(or more precisely, the group of the European individual inferred to be closest to 
the UK individual’s chunk) in the pair and the genetic distance between the two 
chunks’ midpoints. We remove any chunk pairs where this genetic distance is less 
than 1 cM (to avoid the effects of within population linkage disequilibrium con- 
founding signals of admixture) or greater than 50 cM (as linkage disequilibrium 
attributable to admixture will have decayed to zero by this distance). Otherwise we 
round this genetic distance to the nearest 0.1 cM and assign the chunk pair a score 
Scp equal to the product of the two chunks’ sizes in centimorgans, with chunk sizes 
larger than 1 cM fixed to 1 cM. This scoring protocol weights chunks’ contribu- 
tions by their relative size, so that larger chunks contribute more to the score, but 
caps the contribution of any chunk to prevent inference from being dominated by 
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a small number of chunks. We repeat this for all chunk pairs across all *°C, = 190 
combinations of mosaics. After doing so, for each pair of European groups, say A 
and B, and for each 0.1 cM bin d in [1, 1.1, ..., 50 cM] we sum the Scp values across 
all chunk pairs where (i) the genetic distance between the two chunks’ midpoints is 
in dand (ii) one chunk in the pair copies A and the other copies B. We refer to this 
as the ‘coancestry vector’ for pair (A, B), which contains one element for each d. 

We repeat this tabulation for all pairs (A, B) of the 51 European groups, giving 
51 X 51 such coancestry vectors. After a re-scaling and then a re-weighting of these 
coancestry vectors using the inferred ancestry profiles from ‘Estimating ancestry 
profiles’ above (that is, the f,), this gives a set of reweighted coancestry vectors 
(referred to as ‘observed coancestry curves’ in ref. 22), that efficiently capture the 
decay of linkage disequilibrium attributable to admixture (see ref. 22 for details). In 
the course of this re-weighting, we remove European groups whose inferred ances- 
try contribution (f,) to the given UK cluster is less than 0.1%, thus reducing the 
number of European groups remaining for consideration in our analysis. For each 
pair of European groups (M, N), now a subset of the 51 X 51 total pairwise com- 
binations, we label the reweighted coancestry vector vn. We fit a coancestry curve, 
Yun(d), to the values in vy as follows: for each fixed pair (M, N) of Euro- 
pean groups we fit the parametric model 


van (d) =P (d) +6, 
where ¢ >0 is an error term and 
Wun (d) =tun +Oun xe “4. 
Here 4 is interpreted as the date of admixture in generations from present. 
GLOBETROTTER jointly estimates the values of ty, On and A that minimize 
the sum of the mean squared error across the curves, that is, that minimize 


 E(emn(d) — Paw (4))”. 
(MLN) dl 


The values of dyn carry information about which European group best repre- 

sents each admixing source group (if any)—for example, positive values of dyn 
suggest that groups M and N often carry haplotypes representing the same true, 
unsampled admixing source, while negative values of dyn suggest that M and N 
represent different admixing sources. We use principal components and linear 
modelling to jointly analyse all 6,4, both describing the haplotypes carried by each 
admixing source group as a linear combination of those carried by each of the 51 
European groups, and inferring the proportion of admixture contributed from each 
source (see ref. 22 for details). The inferred mixing coefficients from this linear 
modelling, along with the inferred admixture proportions for each source, allows a 
new estimate of the ancestry proportions describing the given UK cluster (that is, 
analogous to those described in estimating ancestry profiles). We can therefore 
re-scale our coancestry vectors using these new ancestry proportions, giving new 
values of ‘Yn (d), from which we can re-infer the date(s) of admixture, offering 
improved accuracy of estimation provided the fitting procedure results in improve- 
ments in characterizing the true source groups. When analysing each UK cluster, 
we repeated this iterative process of ancestry proportion and date inference five 
times. Once these five iterations were completed, we then fixed the inferred ances- 
try proportions, and within each UK cluster performed 100 bootstrap re-samples 
of individuals’ chromosomes to infer 95% confidence intervals for the actual admix- 
ture date. 
Estimating the proportion of Saxon ancestry in central and southern England. 
It is of interest to estimate the proportion of Saxon ancestry in our Cent./S England 
cluster. We have undertaken two separate analyses which bear on this, namely our 
estimated ancestry profiles and the GLOBETROTTER analysis. One challenge is 
that various distinct modern European groups may carry DNA which descends 
from the Saxons (or their ancestors), and hence be informative about the contribu- 
tion of Saxon DNA to the UK. 

The pattern of contributions to UK clusters from GER3, and its location in 
Europe in northern Germany, make it very likely to capture ancestry brought to 
the UK by Saxon migrants (see main text Discussion). As noted in the discussion 
in the Supplementary Note, some of the ancestry shared with the group DEN18 
from modern Denmark could also reflect ancestry brought to the UK by the Saxon 
migrants. Ancestry shared with DEN18 could also have reached the UK in early 
migrations by land or sea, or in later migrations of the Danish Vikings. The fact that 
this group contributes some ancestry to all UK clusters is evidence that some of this 
ancestry sharing may indeed result from early migrations. The increased contri- 
bution of this group to the ancestry profiles of all the English clusters further sug- 
gests that some part also came to the UK with the Saxons. 

The contribution to the ancestry of the UK clusters from FRA17, now spread 
throughout France, is also correlated with the contribution of GER3 and DEN 18. 
One possible explanation for this pattern is that FRA17 also captures Saxon ances- 
try. Another explanation is that it represents ancestry that spread into the UK at a 
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different time, but into many of the same parts of the UK as the DNA from the later 
Saxon migrations. The Saxon migrations did not directly involve people from what 
is now France. There were movements of Germanic peoples, notably the Franks, 
into France around the time of the Saxon migration into England. The Germanic 
ancestry these migrations brought to what is now France would have been Frankish 
rather than Saxon, and it would have been diluted through mixing with the already 
substantial local populations. It thus seems unlikely that ancestry in the UK arising 
from the Saxon migrations would be better captured by FRA17 than by people now 
living near the homeland of the Saxons (represented by GER3)—the contribution 
of FRA17 is about threefold that of GER3. Further, the geographic pattern of 
FRA17 contributions differs from that of GER3 (which we see as very likely Saxon), 
in being relatively much higher in the Scottish and Orkney clusters. This is difficult 
to reconcile with ancestry from both groups arriving as part of the same migration 
event, and the substantial contribution of FRA17 in Scotland and Orkney, relative 
to GER3, is more likely to reflect an earlier influx into the UK, and increased time to 
spread geographically. Also, FRA17 did not figure as one of the source populations 
for the admixture event in Cent./S England estimated by the GLOBETROTTER 
analysis. We thus conclude that the contribution to the UK clusters from FRA17 is 
unlikely to reflect the Saxon migrations. 

In the ancestry profile approach, we thus argue that the proportion of DNA in 
modern Cent./S England inherited from the Saxons is best captured by GER3 and 
some of DEN18, which would suggest a range of ~10% (assuming only GER3 
reflected the Saxon migrations) to ~20% (assuming GER3 and all of DEN18 reflected 
the Saxon migrations). If we were wrong in concluding that the FRA17 contribu- 
tion does not result from DNA which arrived with the Saxon migrations, so that 
some or all of it did reflect Saxon DNA, then the proportion of Saxon ancestry 
could be substantially higher (up to ~50%). 

The GLOBETROTTER analysis of Cent./S England detected an admixture event, 
with a contribution of ~35% of DNA from GER3, with estimated dates for admix- 
ture somewhat after, but consistent with (see Discussion above), the known his- 
torical dates of the Saxon migrations (Extended Data Fig. 9). 

There are inevitable uncertainties in both analyses due to the nature of the data 
— we are trying to estimate admixture proportions for events ~ 1,200-1,500 years 
ago on the basis of DNA from modern populations. Nonetheless we feel it is safe 
to conclude from our analyses that the proportion of Saxon ancestry in Cent./S 
England is very likely to be under 50%, and most likely in the range 10-40%. 
Treatment of Eire. We explicitly excluded samples from Eire (the Republic of 
Ireland) from our European analyses, and as possible contributors to the ancestry 
profiles of the UK clusters, principally to allow assessment of the major migrations 
from continental Europe into the UK. Detailed early analyses, which included sam- 
ples from Eire with the other European samples, provided evidence of shared Irish 
ancestry with our UK samples, presumably reflecting in part migrations from 
Great Britain into Eire and vice versa. Eire thus acts as a source and a sink for ancestry 
from the UK, which severely complicates interpretation of estimated ancestry pro- 
files, since sharing of ancestry with Eire could reflect British migration into Eire 
rather than the converse. Also, the UK and Eire could share ancestry because both 
descend from some similar ancestral populations. 

While there is historical evidence of migration after the collapse of Roman rule 

from Devon and Cornwall into what is now Brittany in north-west France, this 
does not leave a signal in, and hence does not confound, our ancestry analyses, 
either because we do not have appropriate samples from Brittany or because the 
amount of DNA transferred from Britain to France via this route is relatively small. 
(Had this been an effect we would have expected to see either or both of our Devon 
and Cornwall clusters sharing substantially more ancestry from one of the groups 
in France, but this was not the case.) 
Maps and visualization. For the UK map boundaries we used a map of the UK 
sourced from the Office for National Statistics (England and Wales); National 
Records of Scotland; and the Northern Ireland Statistics and Research Agency. The 
European maps were sourced from Eurostat. For context we added the boundaries 
of the Republic of Ireland and the Isle of Man to the UK maps, taken from the 
European maps. Map boundaries were obtained in digitised form*®~’ and were 
drawn using various packages in the statistical software language R. 

The latitude and longitude for each UK sample’s grandparents’ birthplaces was 
assigned (geocoded) automatically’* using a place name gazetteer from Edina (http:// 
www.edina.ac.uk). All locations were checked for consistency between project 
records and the automatic geocoding, and any discrepancy resolved in favour of 
the project records. For the UK cluster analyses shown in Fig. 1, each sample was 
assigned, and plotted at, the average of the latitudes and longitudes of its grand- 
parents’ birthplaces. For clarity of display a small, random, amount of noise was 
added to point’s latitude and longitude to avoid over-plotting. Independently across 
points, a random value was drawn from a uniform distribution on (—20a, 20a), 
where a is the smallest non-zero difference in latitude observed between the loca- 
tions of any pair of points, and the resulting value was then added to the latitude of 


the point. An analogous procedure was then performed, independently, for the 
longitude of each point. 

For the tree depicting the order of the hierarchical merging of clusters in Fig. 1, 
the lengths of the branches relate to changes in the posterior of the fines TRUCTURE 
model. They do not relate directly to time or other measures of genetic distance so 
caution is needed in their interpretation. Some additional length was added to the 
tips of the tree for clarity. 

The ellipses displayed in Fig. 1, Extended Data Figs 1, 3 and 4 and Supplemen- 
tary Fig. 1 were obtained by fitting a two dimensional t-distribution with five de- 
grees of freedom to the plotted spatial locations associated with each cluster. Each 
ellipse depicts the 90% probability region of the fitted distribution. 

Only limited geographic information was known about the European samples: 
often this was just the city or region from which the samples were taken, but some- 
times only a country was known. To visualize the spatial patterns of the European 
genetic groups obtained from the fineSTRUCTURE analyses we plotted the Euro- 
pean samples on a map of Europe, with colours reflecting the groups assigned by 
fineSTRUCTURE. We did this in two ways, plotting individual points for Fig. 2 
(depicting the ancestry profiles) and using pie charts in Extended Data Fig. 5b. 

For Fig. 2 we restricted ourselves to plotting only those European samples that 
have some fine-scale location information (that is, more precise than just country 
of sampling), as these samples will be informative for assessing regional fine struc- 
ture (although all samples in the group are used for generating the ancestry profile). 
As all the samples from a given region/city have exactly the same location assigned 
to them, we added some random noise to each sample’s assigned latitude and lon- 
gitude to enable visualization on a map. To do this for each sample we drew two 
samples from a uniform distribution on (—0.5, 0.5), in units degrees of latitude and 
longitude, and added the results to the sample’s latitude and longitude respectively. 
We plot each sample as a point on the map, coloured to indicate the European group 
to which they are assigned. Figure 2 shows the locations of the samples assigned to 
each European group that contribute at least 2.5% to at least one of the UK clus- 
ters. As several inferred European groups are represented in the French sampling 
locations, and would thus be difficult to discern, the points for groups FRA12, 
FRA14 and FRA17 have been shifted by one degree of both latitude and longitude 
(for FRA12, —1 degree of longitude and —1 degree of latitude; FRA14, +1 and 
+1; FRA17, —1and +1). In Fig. 2 the lines to each group (or set of groups) end at 
the centre of mass of the groups. This was calculated before any samples had their 
locations shifted (as for the French groups, and/or by adding random noise). For 
the Norwegian groups and the Swedish groups the line ends at the average position 
of the centres of mass of the constituent groups. For the groups GER3 and GER6 
the centre of mass is calculated using only those samples from Germany. This is 
because several samples from these groups are assigned to Stockholm, Copenhagen 
and Oslo, all of which are major cities. We assume these samples are migrants from 
Germany, and thus including them would skew the centre of mass position that we 
interpret as the approximate historical locus for the group. This potential problem 
caused less of an issue for the other groups depicted in Fig. 2. 

For Extended Data Fig. 5b, the spatial patterns of the European genetic groups 
obtained from the fineSTRUCTURE analyses are displayed in pie charts. All of the 
samples from the same location are displayed together in a single pie chart, with the 
sectors of the pie chart coloured to reflect the proportion of samples from that loca- 
tion that are assigned by fineSTRUCTURE to a given group. The pie charts are 
centred at suitable locations on the map of Europe, depending on the geographic 
information known (see relevant figure captions). The size of the pie chart indicates 
the number of samples represented by the chart. The number of samples repre- 
sented by a chart is proportional to the area of the chart. For the larger sampling 
locations, ifa European group accounts for at least 20% ofa location’s samples then 
the European group number is also displayed on the edge of the appropriate sector 
of the associated pie chart. 

For the ancestry profile analyses we also display pie charts, this time on a map of 
the UK (Extended Data Fig. 6). Here each pie chart relates to one of the inferred UK 
clusters and is displayed at the centre of the cluster’s associated ellipse (as described 
above). Each sector of the pie chart is coloured (and sometimes numbered) by the 
relevant colour (and number) of the European group it relates to. The subtended 
angle of each sector represents the proportion of the UK cluster’s ancestry that is 
most similar to that of samples from the relevant European group as described in 
the estimating ancestry profiles section above. 

Consent and study protocol. Informed consent was obtained from all subjects. For 
the UK subjects, ethics approval was granted by the NRES Committee, Yorkshire 
and the Humber — Leeds West, UK (Reference 05/Q1205/35) in March 2005. For 
the European subjects, informed consent and ethics approval was obtained as part 
of the WTCCC2 multiple sclerosis study". 

Code availability. The software CHROMOPAINTER, fineSTRUCTURE and 
GLOBETROTTER are available for download at (http://www.paintmychromosomes. 
com/). 
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Extended Data Figure 1 | The effect of setting a threshold on the confidence 
of cluster assignment for the genetic clusters in the UK inferred by the 
fineSTRUCTURE analysis. The UK map depicts the clustering of the 2,039 
UK individuals into 17 clusters on the basis of genetics alone. See Fig. 1 for 
further details. Here a threshold is set on the measurement of confidence used 
for assigning individuals to clusters (see Methods). This measure is defined 
on the interval [0, 1], where the value 1 is interpreted as meaning complete 
certainty of cluster assignment and 0 as being complete lack of certainty. The 
plot illustrates the effect of setting a threshold of 0.7 so that a UK individual is 
only assigned to a cluster if the measure of assignment for that individual is 
greater than 0.7. All of the samples that have small, faded symbols are assigned 


: 4 
92 7 99 
73 8 81 
81 0 81 
75 0 75 
63 11 74 
53 15 68 
51 0 51 
39 1 40 
35 1 36 
32 2 34 
28 0 28 
21 2 23 
il 5 16 
14 0 14 
1860 179 2039 


to their clusters with confidence greater than 0.7. Those samples for which 
the assignment is less confident (that is, the measure is less than or equal to 0.7) 
are plotted with large, bold symbols. The table shows the number of individuals 
with confidence measure above and below the 0.7 threshold together with 
the total for each UK cluster. The slight discrepancy between the totals in this 
figure and Supplementary Information Fig. 1.16 is due to differences in the 
method for assigning individuals to clusters (see Methods). The threshold of 0.7 
was chosen for illustrative purposes only. Similar patterns relate to other 
thresholds. Contains OS data © Crown copyright and database right 2012. 
© EuroGeographics for some administrative boundaries. 
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Extended Data Figure 2 | Convergence of the algorithm implemented in 
fineSTRUCTURE. The fineSTRUCTURE clustering algorithm was run twice 
on the UK samples (a) and twice on the European samples (b) to assess 
convergence. The displayed heatmap depicts the proportion of sampled 
MCMC iterations for which each pair of UK individuals is assigned to the same 
cluster. The values above and below the diagonal represent two different runs of 
fineSTRUCTURE. Individuals are ordered along each axis according to the 


0.89 


0.78 


0.67 


0.45 


0.34 


0.23 


0.01 


inferred tree from the fineSTRUCTURE run above the diagonal, with tick 
marks on the axes at the middle of each cluster. Comparison between runs is 
made by comparing the plot above the diagonal (run two) with that below the 
diagonal (run one). The high degree of symmetry in the plot confirms the 
similarity between the runs and hence that each MCMC run has converged to 
very similar clusters. 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


8 
as 
= 
o 
3 
£3 
8 
By 
Be 
a 
2 
“010. 005 0.00. 
Principal Component 1 
S 3 
° ° 
5 5 
a af 
€ € 
6 8 
ze ze 
= a 
z z 
a3 as 
T i 
010 005 0.00. 
Principal Component 1 
vv vv + 
53 5B 53 
2 2 2 
3 3 3 
= = — 
88 83 83 
wo 3 < 
feo ie = 
3 3 Ss 
= £ bt 
ig ag ag 
z ® z 
10-0105 0.00 9,05 095 000,005.10 
Principal Component 2 Principal Component 3 
038 og og os 
rats eS ge oa) 
S oy 5 5 
= e e e 
a a 2 2 
88 88 88 8s 
@ @ cc cc 
2 2 2 2 
2 2 2 2 
as ae as as 


O10 


~0.05 0.00 9,05 
Principal Component 2 


010-005 0.00, 
Principal Component 1 


Extended Data Figure 3 | Application of standard methods for detecting 
population structure to the UK data. a, Genome-wide principal component 
analysis of the UK samples. The UK samples plotted against all pairs of 
principal component axes, for the first five axes, as determined in the 
genome-wide principal components analysis. Each individual is depicted by a 
symbol representing the district from which it was collected. The labels of the 
sample collection districts are interpreted as follows: CUM, Cumbria; LIN, 
Lincolnshire; NEA, north east England; OXF, Oxfordshire; YOR, Yorkshire; 
CHE, Cheshire; NTH, Northamptonshire; NOT, Nottinghamshire; DOR, 
Dorset; SUS, Sussex; NOR, Norfolk; WOR, Worcestershire; DEV, Devon; SPE, 
south Pembrokeshire; COR, Cornwall; NWA, north Wales; ARG, Argyle and 
Bute; NPE, North Pembrokeshire; BAN, Banff and Buchan; NIR, Northern 
Ireland; ORK, Orkney; SUF, Suffolk; LEI, Leicestershire; FOD, Forest of Dean; 
HER, Herefordshire; HAM, Hampshire; DER, Derbyshire; LAN, Lancashire; 


095 000,005 
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KEN, Kent; GLO, Gloucestershire. b, Clustering the UK samples using the 
program ADMIXTURE. ADMIXTURE was applied in three scenarios, 
corresponding to different preset values for K, the number of clusters into 
which the UK samples are divided. Here K = 2, 3 and 17 (see Methods). A map 
is shown for each value of K. Each symbol on the map corresponds to one of the 
sampled individuals and is plotted at the centroid of their grandparents’ 
birthplace. Each cluster is represented by a unique combination of colour and 
plotting symbol, with individuals depicted with the symbol of the cluster to 
which they are assigned. The ellipses centred on each cluster give a sense of 
the extent of the cluster by showing the 90% probability region of the 
two-dimensional t-distribution (5 degrees of freedom) which best fits the 
locations of the individuals in the cluster. Contains OS data © Crown 
copyright and database right 2012. © EuroGeographics for some 
administrative boundaries. 
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Extended Data Figure 4 | Potential recent shared ancestry in the genetic 
clusters in the UK inferred by the fineSTRUCTURE analysis. a, The UK map 
on the left depicts the clustering of the 2,039 UK individuals into 17 clusters on 
the basis of genetics alone. See Fig. 1 for further details. Pairwise identity by 
descent (IBD) within clusters and across the whole UK sample for all of the 
2,039 UK individuals is shown to the right. For each of the inferred UK clusters 
a box and whisker plot shows the distribution of the pairwise IBD statistic 
(see Methods). Each box is filled by the colour of the cluster to which it relates, 
and the outlier points have the same shape as the cluster to which they relate. 
For comparison the distribution of the pairwise IBD statistic across the whole 


a “Number of Samples per Cluster 
UK sample is shown on the far right, with the box coloured grey. The light grey 
horizontal lines indicate the upper and lower quartiles of the IBD statistic’s 
distribution for the whole UK sample. Along the x axis the number of 
samples in the associated cluster is shown. The y axis gives the value of 

the pairwise IBD statistic. b, The same information as a but with 53 

clusters of UK individuals. Note that only clusters of size 4 or less depart 
substantially from the average relatedness. Contains OS data © Crown 
copyright and database right 2012. © EuroGeographics for some 
administrative boundaries. 
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Extended Data Figure 5 | Population structure in the European samples. 
a, Number of samples derived from each European sampling region. The 6,209 
European samples used for the analyses were sampled from ten countries and 
various locations within each country. Each sample has a specific sampling 
location (often a city, but in some cases only a whole country). The numbers 
shown give the number of samples derived from a particular location. 

Some numbers are depicted out of position for clarity. In these cases a line leads 
from the number to the actual location. Where the sample locations 

are well-localized (for example, the city of sampling is known) the box 
surrounding the number is white. When only information about the country of 
sampling is known the box is coloured yellow. The numbers are overlain on a 
faded version of the pie charts from panel b for easy reference. b, European 
population structure inferred by fineSTRUCTURE. The 6,209 European 
samples divided into 51 genetic groups (represented by colours and labelled 
with a subset of the numbers between 1 and 145) using fineSTRUCTURE. For 
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clarity the colour space has been skewed to emphasize the differences 
between groups 1 to 18 as these groups are the major contributors to the 
ancestry profiles of the UK clusters. Each sample has a specific sampling 
location (often a city, but in some cases only a country, see panel a). The pie 
charts are located at these sampling locations, and depict the proportion of the 
samples from that location assigned to each of the 51 genetic groups. 

Each genetic group also has a label number, which is displayed for the larger 
sectors of each of the pie charts. The area of the pie chart is proportional to 
the number of samples from that location. Pie charts with black borders 
correspond to well-localized samples. In contrast, for samples where only 

the country of sampling is known, they are combined in a single pie chart for 
the country, which is shown with white borders. Some pie charts are 
depicted out of position for clarity; in these cases a line leads from 

the chart to the actual location. © EuroGeographics for the 

administrative boundaries. 
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Extended Data Figure 6 | European ancestry profiles of the UK clusters. 

a, The map of the UK shown relates to the map with 17 UK clusters shown 
in Fig. 1. Ellipses indicate the extent of the UK clusters as in Fig. 1. The pie charts 
represent the ancestry profile of the UK clusters from Fig. 1. Each pie chart 

is plotted at the centroid of the corresponding cluster, although some pie charts 
have been moved for clarity; in the cases where the relocation is substantial a red 
line leads from the pie chart to the centroid. The sectors of the pie charts are 
coloured with the colours of the European genetic groups (for the larger sectors 
the number of the European group is also given). They indicate the ancestry 
profiles of each UK cluster, namely the proportion of the cluster ancestry that is 
best represented by each of the European groups. The magnitude of the angle of 
a sector is proportional to the contribution of that European group to the 
ancestry profile of the associated UK cluster. The symbols in the grey bar to the 
left of the map represent the UK clusters as in Fig. 1. The bar chart in the left 
part of the plot depicts the same ancestry profiles of the UK clusters in a 
different way. Each row represents a UK cluster (arranged roughly north to 
south) with the symbols for the clusters from Fig. 1 indicated at each end of the 
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row. Each column represents a European group, with group numbers listed 
with a three letter prefix that, for clarity, relates to the country or countries 
where the cluster is most represented. The colour of each bar also indicates the 
European group to which the bar relates. Confidence intervals (95%) obtained 
from 1,000 bootstraps of the ancestry profile analysis (see Methods) are 
indicated on each bar. b, Renormalized ancestry profiles of the UK clusters 
illustrating possible early European contributions to the UK population. A 
representation of the relative contributions to the UK clusters from the three 
European groups (GER6-W. Germany, BEL11- Belgium, and FRA14-NW 
France) hypothesized to be the major contributors to the earliest migrations 
into the UK after the last ice age from which DNA survives to the present in 
substantial proportions (see Supplementary Note). Interpretation of the 

map, pie charts and bar chart is as for a. In this case, however, the proportions 
were renormalized to sum to 1 for the contributions from GER6, BEL11 

and FRA14. Contains OS data © Crown copyright and database right 2012. 
© EuroGeographics for some administrative boundaries. 


©2015 Macmillan Publishers Limited. All rights reserved 


a___ ARRIVAL OF AGRICULTURE & SPREAD OF BEAKER POTTERY ,, sa 
Neolothic 4000 BC Early Bronze Age 2500 BC 


Maritime: 
style beaker: 
from Spain: 


b IRON AGES TRIBES OF BRITAIN 
40 AD 


a 
Some movement : 
by Belgae - B 
c.80 BC *’,, 


ARTICLE 


Extended Data Figure 7 | More major events in the peopling of the British Isles. See Supplementary Note for further details. a, The arrival of agriculture and 
subsequent migrations from 4000-2500 Bc. b, The major iron age tribes of Britain around the year 40. © EuroGeographics for the administrative boundaries 
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Extended Data Figure 8 | Application of GLOBETROTTER to infer 
simulation of ancestry 40 generations ago between groups from Northern 
Germany (GER3, 25%) and Italy (ITA36, 75%). Twenty-five admixed 
individuals were simulated, and the individuals used to construct these 
simulated individuals were then removed from the list of potential donors 
(see Methods). Left barplot, and map: the barplot shows the true population 
and proportion contributed for each of the two admixing groups. The map 
shows, for each of the European sampling locations, the true proportion of 
individuals sampled from that location assigned to each of the admixing 
groups, coloured according to the barplot. Central three plots: example curves 
constructed by GLOBETROTTER to infer admixture times, and infer details of 
admixing groups (see Methods and Supplementary Note). For each pair of 
populations A and B (A can be the same as or different from B) the points show 
the empirical probability, relative to under independence, as a function of 
genetic distance x, that two positions separated by distance x correspond to 
ancestry donated by population A, and by population B, respectively. The green 
line shows GLOBETROTTER fitted exponential decay curves for the 
underlying (that is, expected) value of this relative probability estimate. Under 
a model of a single admixture event occurring g generations ago, this 
probability decays at a rate g according to theory, providing an estimate of 
the admixture time (and 95% CI) shown overlaying the curve ITA36 versus 
GER3. If ancestries A and B associate with the same admixing group, for 
example, whenever A = B the fitted curve will have a negative slope, as seen for 


the GER3 versus GER3 plot. If a positive slope is seen, as for the ITA36 versus 
GER3 plot, this implies these populations contribute to the two different 
respective admixing groups. Right bar-plot, and map: GLOBETROTTER 
produces an inference of the genetic composition of (haplotypes carried by) the 
two admixing groups, as a mixture of (haplotypes carried by) populations 
actually sampled. This mixture inference jointly uses curves for pairs of 
sampled populations, and the overall haplotypic makeup of different sampled 
populations, including the admixed group. The bar-plot shows the inferred 
mixture representation (dominated in each case by the true admixing groups) 
and estimated admixture proportion (24%, close to the truth of 25%), with 
more red/blue populations respectively giving a larger contribution. The map 
shows populations inferred as contributing to the first (pink/red shades) or 
second (blue shades) admixing group, respectively, as for the left map, with 
populations coloured according to the bar-plot. This shows populations falsely 
inferred as contributing material to the admixing groups were still sampled, 
mainly, from locations close to those of the true admixing groups. We 
caution that in this setting of admixture between genetically similar 
European groups, estimation of admixture fraction is very uncertain (see 
Methods) (for example, contributing populations are often impossible to 
definitively assign to a ‘side’ of the event). For further details of the analysis, 
for example, tests for admixture presence in this simulation, see 

Methods and Supplementary Note. © EuroGeographics for the 
administrative boundaries. 
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Extended Data Figure 9 | Application of GLOBETROTTER to infer details 
of admixture in the UK clusters. a, Inferring admixture in a population of 
1,044 UK individuals from central and southern England. Left hand plot: the 
bold red squares show mean grandparental birthplace for each individual 

in this cluster. Central three plots: example curves constructed by 
GLOBETROTTER to infer admixture times, and infer details of admixing 
groups (see Methods and Supplementary Note). For each pair of populations 
A and B (A can be the same as or different from B) the points show the 
empirical probability, relative to under independence, as a function of genetic 
distance x, that two positions separated by distance x correspond to ancestry 
donated by population A, and by population B, respectively. The green line 
shows GLOBETROTTER fitted exponential decay curves for the underlying 
(that is, expected) value of this relative probability estimate. Under a model 
of a single admixture event occurring g generations ago, this probability decays 
at a rate g according to theory, providing an estimate of the admixture time 
(and 95% CI) shown overlaying curves SFS31 versus GER3 and SFS31 versus 
SFS31. If ancestries A and B associate with the same admixing group, for 
example, whenever A = B the fitted curve will have negative slope, as seen for 
the GER3 versus GER3 plot. If a positive slope is seen, as for the SFS31 versus 
GER3 plot, this implies these populations contribute to the two different 
respective admixing groups. Right bar-plot, and map: GLOBETROTTER 
inference shows one possibility for the genetic composition of (haplotypes 
carried by) the two unsampled historical admixing groups, as a mixture of 
(haplotypes carried by) populations actually sampled. This mixture inference 
jointly uses curves for pairs of sampled populations, and the overall haplotypic 
makeup of different sampled populations, including the admixed group. The 
bar-plot shows the inferred mixture representation (with largest contributions 
in each case by GER3/DEN18, sampled most frequently from northern 
Germany and Denmark, and SFS31/ITA52, sampled mainly from southern 
France and Spain and northern Italy) and estimated admixture proportion 
(34%), with more intense red/blue populations respectively implying a larger 
contribution. The map shows populations inferred as contributing to the first 
(pink/red shades) or second (blue shades) admixing group respectively, with 
populations coloured according to the bar-plot. We caution that in this setting 
of admixture between genetically similar European groups, estimation of 
admixture fraction is very uncertain (see Methods and Supplementary Note) 
(for example, contributing populations are often impossible to definitively 
assign to a side of the event), so that other closely related scenarios, for example, 
a somewhat lower admixture fraction from a more completely “GER3’-like 
group than that inferred, are likely consistent with the GLOBETROTTER 
results seen. b, Inferring admixture in a population of 51 UK individuals from 
Orkney. Left hand plot: the bold purple squares show mean grandparental 


birthplace for each individual in this cluster. Central three plots: example curves 
constructed by GLOBETROTTER to infer admixture times, and infer details of 
admixing groups (see Methods and Supplementary Note). For each pair of 
populations A and B (A can be the same as or different from B) the points show 
the empirical probability, relative to under independence, as a function of 
genetic distance x, that two positions separated by distance x correspond to 
ancestry donated by population A, and by population B, respectively. The green 
line shows GLOBETROTTER fitted exponential decay curves for the 
underlying (that is, expected) value of this relative probability estimate. 
Under a model of a single admixture event occurring g generations ago, this 
probability decays at a rate g according to theory, providing an estimate of the 
admixture time (and 95% CI) shown overlaying curves NOR90 versus FRA12 
and NOR90 versus NOR90. If ancestries A and B associate with the same 
admixing group, for example, whenever A = B the fitted curve will have 
negative slope, as seen for the NOR90 versus NOR90 plot. If a positive slope is 
seen, as for the NOR90 versus FRA12 plot, this implies these populations 
contribute to the two different respective admixing groups. Right bar-plot, and 
map: GLOBETROTTER inference shows one possibility for the genetic 
composition of (haplotypes carried by) the two unsampled historical admixing 
groups, as a mixture of (haplotypes carried by) populations actually sampled. 
This mixture inference jointly uses curves for pairs of sampled populations, 
and the overall haplotypic makeup of different sampled populations, including 
the admixed group. The bar-plot shows the inferred mixture representation 
(with largest contribution in each case by GER3/NOR90, sampled most 
frequently from northern Germany and Norway, and FRA12/FRA14, both 
sampled mainly from France) and estimated admixture proportion (42%), with 
more intense red/blue populations respectively implying a larger contribution. 
The map shows populations inferred as contributing to the first (pink/red 
shades) or second (blue shades) admixing group respectively, with populations 
coloured according to the bar-plot. We caution that in this setting of admixture 
between genetically similar European groups, estimation of admixture 
fraction is very uncertain (see Methods and Supplementary Note) (for example, 
contributing populations are often impossible to definitively assign to a ‘side’ 
of the event). In particular, inspection of curves involving GER3 does not 
yield a clear ‘side’ of the event for this population, unlike the NOR90 versus 
FRA12 case that implies French-like and Norwegian-like haplotype presence 
must occur mainly in distinct admixing groups. Therefore the GER3 
component might in fact capture haplotypes for either (or both) the French-like 
or Norwegian-like admixing groups, and the inferred scenario shows only 
one possibility. Contains OS data © Crown copyright and database right 2012. 
© EuroGeographics for some administrative boundaries. 
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Visualizing transient 


Watson-Crick-like 


mispairs in DNA and RNA duplexes 


Isaac J. Kimsey', Katja Petzold”, Bharathwaj Sathyamoorthy’, Zachary W. Stein? & Hashim M. Al-Hashimi! 


Rare tautomeric and anionic nucleobases are believed to have fundamental biological roles, but their prevalence and 
functional importance has remained elusive because they exist transiently, in low abundance, and involve subtle move- 
ments of protons that are difficult to visualize. Using NMR relaxation dispersion, we show here that wobble dGedT and 
rGerU mispairs in DNA and RNA duplexes exist in dynamic equilibrium with short-lived, low-populated Watson- 
Crick-like mispairs that are stabilized by rare enolic or anionic bases. These mispairs can evade Watson-Crick fidelity 
checkpoints and form with probabilities (107? to 10~°) that strongly imply a universal role in replication and translation 
errors. Our results indicate that rare tautomeric and anionic bases are widespread in nucleic acids, expanding their 
structural and functional complexity beyond that attainable with canonical bases. 


Nucleic acid bases exist predominantly in one neutral tautomeric form. 
This in turn gives rise to the strict Watson—Crick (WC) pairing rules 
(Fig. 1a) that govern how genetic information is replicated, transcribed, 
and translated. However, if bases adopt alternative energetically dis- 
favoured tautomeric or anionic forms (Fig. la), pairing rules can be 
violated and new functions can emerge. For example, although rarely ob- 
served, minor tautomeric and anionic bases can form WC-like dGedT'?, 
dAedC’, and rGerU** mispairs that are believed to contribute to spon- 
taneous mutations’”~° and translational errors'°. Chemical modifica- 
tions that stabilize or lock bases in their anionic or enol-like forms can 
be mutagenic’*” or expand the decoding capacity of transfer RNAs'*"*, 
In addition, anionic and tautomeric forms of the bases are believed to 
play crucial roles in nucleic acid catalysis'>'*, RNA-ligand recognition'”"*, 
and in the therapeutic mechanisms of nucleic acid base analogues”. 


Despite growing evidence that rare tautomeric and anionic bases have 
important roles in nucleic acids, their occurrence, stabilities, and bio- 
logical significance has remained elusive. Characterizing rare tauto- 
meric and anionic bases in polynucleotides is a longstanding problem 
because such energetically unfavourable species typically exist in low 
abundance, for short periods of time, and involve movements of protons 
that are difficult to visualize at the atomic level. NMR relaxation dis- 
persion (RD) techniques” ** enable characterization of low-populated 
(populations of 0.1-10%) transient (lifetimes of micro- to milliseconds) 
states of nucleic acids***° that are often referred to as ‘excited states’ 
(ES). Here, we use NMR RD to characterize transient WC-like dGedT 
and rGerU mispairs in DNA and RNA thatare stabilized by rare tauto- 
meric and anionic bases and obtain evidence that they have universal 
roles in misincorporation during replication and translation. 
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Figure 1 | Chemical exchange in dGedT mispairs. a, WC dGedC, sterically 
prohibited WC dGedT, and WB dGedT (Rj, measured nuclei highlighted in 
red ovals). Below are four WC-like tautomeric and anionic (implied charge 
delocalization) bp. b, DNA duplex with a '°C/"°N site-labelled dGedT mispair. 
c, RD profiles for dGedT (25 °C and pH 6.9) showing R; + R,, as a function of 
the spin lock offset (Qo 2m ') and power (ws, 2m ', in insets) with global fits 


Qqyyy 200-1 (KHz) 


to dG-N1, dG-C8, dT-N3, and dT-Cé. Error bars represent experimental 
uncertainty (one s.d., see Methods). 7 = 1.59, R? = 0.99, and P< 0.0001. 
d, RD profiles showing three-state exchange (25 °C and pH 8.4) and global 
three-state fit to dG-N1 and dT-N3. Error bars represent experimental 
uncertainty (one s.d.). 7 = 0.88, R? = 0.99, and P< 0.0001. 
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Transient WC-like dGedT tautomer mispair 

dGedT mispairs generally adopt a distinct ‘wobble’ (WB) geometry 
(Fig. la) since a WC geometry results in a steric clash between imino 
protons (Fig. 1a). However, enol tautomers of dG or dT, or their anionic 
form, can alleviate this steric clash and allow formation of WC-like 
dGedT mispairs (Fig. 1a). Soon after the discovery of the DNA double 
helix, Watson and Crick hypothesized that such WC-like mispairs could 
provide a basis for spontaneous mutations’. We used NMR rotating 
frame spin relaxation (R,,) RD*°**”’ to examine whether WB dGedT 
mispairs can transiently morph into such WC-like dGedT mispairs in 
canonical DNA duplexes. For these studies we used a hairpin DNA du- 
plex (hp-GT DNA) containing a site-specifically '*C/!°N-labelled dGedT 
WB mispair (Fig. 1b and Extended Data Fig. 1). Exchange between WB 
and WC dGedT mispairs entails deprotonation of either dG-N1 or dT- 
N3 via tautomerization (neutral) or ionization (charged), both of which 
would induce large changes in N1/3 chemical shifts (CSs) and therefore 
give rise to substantial 15N RD. In contrast, because a WB-to-WC tran- 
sition preserves an anti base and C2’-endo sugar pucker, it is expected 
to induce smaller changes in the sugar (dG-C1’ and dT-C1’) and base 
(dG-C8 and dT-C6) carbon CSs and therefore induce more limited 
°C RD. We observed very substantial °N RD at base imino dG-N1 
and to a lesser extent at dT-N3, much less substantial °C RD at base 
dG-C8 and dT-C6, and essentially no °C RD at sugar dG-C1' and dT- 
Cl’ at pH 6.9 and 25 °C (Fig. 1c and Extended Data Figs 2, 3). This uni- 
que pattern of RD is consistent with exchange directed towards a transient 
WC-like mispair (Fig. 1a). It is inconsistent with exchange directed to- 
wards other base pair (bp) geometries such as Hoogsteen™ or base opened 
states (Supplementary Discussion 1 and Extended Data Fig. 4). A sec- 
ond exchange process that was apparent at pH 8.4 (Fig. 1d) will be dis- 
cussed further below. Similar RD profiles were observed in a different 
DNA duplex (Extended Data Figs 1, 2), indicating that the observed 
dGedT exchange occurs robustly in DNA duplexes. 
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Figure 2 | Characterizing WC-like transient states. a, Population and 
lifetime of dGedT ES1/ES2 measured in hp-GT DNA as a function of 
temperature (at pH 6.9) and pH (at 25 °C). Error bars in fitted parameters 
reflect experimental uncertainty (one s.d.). b, Differences between the GS 
(referenced to 0 p.p.m.) and ES CSs (Aw = @gs — Wes) for hp-GT DNA. 
Error bars reflect experimental uncertainty (one s.d.). c, Measured CSs for ES1 
are plotted against DFT-predicted values. R? = 0.99 and P = 0.003. 
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The RD data measured at dG-N1, dT-N3, dG-C8 and dT-C6 could 
be globally fitted (Supplementary Table 1 and Extended Data Figs 2, 3) 
to a single exchange process directed towards an excited state (ES1) that 
has a population (pgs;) of ~0.17% and a lifetime (tg5,) of ~0.38 ms 
(Fig. 2a). ES1 is characterized by '°N CSs that are considerably down- 
field shifted for dG-N1 (Amy; = +36 p.p.m.) and toa lesser extent dT-N3 
(Aoy3 = +18 p.p.m.) (Fig. 2b, Extended Data Fig. 4). The downfield- 
shifted imino nitrogen CSs are unprecedented for nucleic acids, and are 
directed towards the CSs of bases (dG and dT) that have been depro- 
tonated owing to ionization or modifications that lock an enol-like form 
(Aw ~ + 50-60 p.p.m.) (Extended Data Fig. 5)*-*°. On the other hand, 
ES1 features much smaller changes in carbon CSs (Fig. 2b), consistent 
with a WB-to-WC transition. 

It would be highly energetically disfavoured to simultaneously depro- 
tonate dG-N1 and dT-N3 when forming ES1. Moreover, although the 
magnitude of the ES1 N1/N3 downfield CSs strongly suggests depro- 
tonation, it is not as far downfield shifted as expected based on deproto- 
nation of nucleotides in free solution (Extended Data Fig. 5). Therefore, 
the strongly but incompletely downfield-shifted dG-N1 and dT-N3 CSs 
suggest that ES1 consists of at least two WC-like species in rapid ex- 
change on the NMR timescale in which either dG-N1 (dG°"°'*dT or 
dG>edT) or dT-N3 (dGed T°" or dGedT") is deprotonated (Fig. 1a). 
The ES1 population and CSs are largely independent of pH within the 
pH range of 6.0-7.9 (Fig. 2a and Supplementary Discussion 2). This is 
inconsistent with exchange directed towards ionic dG edT and dGedT™ 
(Extended Data Fig. 5). Rather, the population of ES1 increases with 
temperature (Fig. 2a) as expected for tautomeric species dG*"'*dT and 
dGedT*™. 

Based on the measured pgs) and Tx), the free energy difference (AG) 
between GS and ES1 is 3.8 kcal mol! and the forward free energy bar- 
rier (AG*) is 16.4 kcal mol + (Extended Data Fig. 6). These values are 
in good agreement with computationally predicted parameters (2.8- 
5.6 kcal mol (ref. 31) and ~17-21 kcal mol’ (refs 31, 32), respect- 
ively) for dGedT WB-to-WC tautomer transitions. 

These computational studies also predict that dG°"°*dT probably 
exists in fast exchange on the NMR timescale (free energy barrier ~5- 
6 kcal mol}, ref. 31) with a minor (20%) dGedT™! species (Fig. la 
and Supplementary Discussions 3, 4). Under these conditions, the mea- 
sured ES1 CSs would represent a population-weighted average of the 
two tautomeric states (Methods). We find that all ES1 CSs (dG-N1, dT- 
N3, dG-C8 and dT-C6) are in quantitative agreement with values pre- 
dicted by density functional theory (DFT)”’ calculations for a weighted 
dG""'*dT (80%) =dGedT*"°'(20%) equilibrium (Figs 2c and 3a). 


Transient WC-like dGedT anionic mispair 
Interestingly, upon increasing the pH to 8.4, we observed evidence for 
a second excited state (ES2), which is seen as a second peak in the off- 
resonance RD profile of dT-N3 (Fig. 1d). Global fitting of these RD 
data (Extended Data Figs 2 and 3) revealed two excited states (ES1 and 
ES2) that are most probably arranged in a linear topology (ES1=dGe 
dT=ES2). 

Compared to ES1, ES2 (AG = 16.4and AG = 4.59 kcal mol!) has 
a considerably lower population (pgs2 ~ 0.04%) and lifetime (tgs2 ~ 
70 pls) at pH 8.4 (Fig. 2a). The dG-N1 and dT-N3 ES2 CSs are not only 
‘swapped’ relative to ES1 such that dT-N3 experiences the larger down- 
field shift (A@y3 = +56 p.p.m.) while dG-N1 experiences a smaller down- 
field shift (A@y, = +9 p.p.m.) (Fig. 2b), they are also more asymmetric 
in favour of a deprotonated dT species. In addition, unlike pgs), PEes2 
increases considerably with pH, consistent with ionization and the for- 
mation of dGedT™ (Fig. 3a). The ES2 CSs are in excellent agreement 
with values computed using DFT assuming a WC-like dGedT™ (100%) 
species (Fig. 3a, b). However, we cannot rule out that dGedT™ is in rapid 
equilibrium with a WC-like dG «dT or an inverted wobble (iWB) geo- 
metry***** (Extended Data Fig. 4 and Supplementary Discussion 1) that 
falls outside detection limits (Fig. 3a). 
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Figure 3 | Mutate-and-CS fingerprinting ES1 and ES2. a, Multi-state 
equilibrium between WB and WC-like dGedT mispairs. ES1/ES2 populations 
and weights are shown (25 °C and pH 6.9). pgs estimated based on the 
observed apparent pK,. b, CS fingerprinting dGedT ES1/ES2 using chemical 
modifications and structure-based DFT predictions of CSs. ES1 DFT CSs 


Fingerprinting the dGedT excited states 

We adapted a mutate-and-C$ fingerprint strategy**”* to test the pro- 
posed ES1 and ES2 (Fig. 3a); chemical modifications are used to trap 
an ES, or induce specific perturbations to the GS=ES equilibrium. We 
trapped ES1 (dG°"'*dT) using the mutagenic base 06-methyl-2'- 
deoxyguanosine (™°dG) (Fig. 3c and Extended Data Fig. 7), which is 
known to adopta distorted WC dG*""'*dT-like mispair'*”. Relative to 
the WB, this modification resulted in negligible changes in dG-Cl’ 
(Amey = —0.1 p.p.m.) and dT-C1’ (Awe, = +0.7 p.p.m.) CSs, anda 
small downfield shift in dG-C8 (Amcg = +1 p.p.m.), consistent with the 
RD-derived ES1 CSs (Fig. 3b and Extended Data Fig. 7). The modifi- 
cation induced a small upfield shift in dT-C6 (Amc. = —0.5 p.p.m.) that 
is inconsistent with the downfield dT-C6 CS (Fig. 3b) observed by RD. 
However, such a deviation is expected on the basis of DFT calculations 
(Aw@cs = —2.3 p.p.m. for dT-Cé6 in the ™°dGeT pair) (Supplementary 
Discussion 5) and can be attributed to minor deviations from an ideal 
dG°"'edT WC-like mispair geometry (Fig. 3c)!"°. Severe line broad- 
ening did not permit measurement of the dT-N3 CSs in these non- 
isotopically enriched samples. 

To test the proposed dGedT” ES2, we measured the difference in 
dT-N3 CS between neutral and anionic dTTP (Awy3 = +55 p.p.m.) 
and found them to be in excellent agreement with the dT-N3 CS dif- 
ferences measured by RD (Amy3 = +56 p.p.m.) (Fig. 3b and Extended 
Data Fig. 5). In addition, we used the mutagenic thymidine-analogue, 
5-bromo-2'-deoxyuridine (°"'dU) to push the equilibrium towards 
dG-*®du- (Fig. 3d). This modification lowers the pK, of BrdU-N3 
(~8.6) and favours a WC-like dGe*"dU" geometry at high pH". This 
modification increased the population of ES2 (AG* = 15.1 and AG 
3.37 kcal mol‘) by over two orders of magnitude at the expense of ES1 
(AG? = 16.0 and AG = 4.67 kcal mol _') while minimally affecting the 
ES1 and ES2 Cs (Fig. 3b). The consistencies in ES2 CSs between dGedT™ 
and dGe*""dU" mispairs further support a dominant WC-like ES2, ra- 
ther than an iWB bp, in naked DNA. The unaffected ES1 CSs indicate 
that °?'dU does not appreciably impact the dG" "dU=dGe"du"" 
equilibrium (Fig. 3d) relative to dGedT (Fig. 3a), consistent with prior 
computational studies*’”**. 


are given for 80:20 dG™°+dT:dGedT*" weighting. Error bars reflect 
experimental uncertainty (one s.d.). ¢, ™64GedT structure!?*, d, dGe°?"dU- 
ES2 stabilized relative to dGedT™ ES2. Populations and weights (10 °C 

and pH 6.9) are shown. 


Transient WC-like rGerU mispairs in RNA 
If the observed ESs correspond to WC-like dGedT mispairs, it is pre- 
dicted that similar ESs should arise in rGerU mispairs in RNA where 
WC base pairs are also readily accommodated within the A-form helix. 
To test this hypothesis, we carried out analogous pH- and temperature- 
dependent RD measurements on two RNA duplexes. RD profiles mea- 
sured for rGerU in A-form RNA (Fig. 4a and Extended Data Fig. 8) are 
very similar to those measured for dGedT in B-form DNA (Fig. Ic, d). 
Global analyses of the RD data (Supplementary Table 1) revealed an 
apparent three-state exchange process at pH = 7.9 (Extended Data Fig. 3). 
The RD-derived CSs (Fig. 4b and Extended Data Fig. 9), together with 
the pH and temperature dependence of the populations and lifetimes 
(Fig. 4c), are consistent with rGerU""'=rG'*rU as ES1 and rGerU- 
as ES2 (Fig. 4d) as observed in DNA. The rGerU ES1 forward free en- 
ergy barrier (AG? = 15.7 and AG = 3.86 kcal mol _') is consistent with 
barriers measured for dGedT ES1 (16.4 kcal mol") and computation- 
ally predicted barriers for GeU tautomerization (17.1 kcal mol ‘)*?, 

Computational studies* show that dGedU“"" is stabilized relative to 
dGedT“”. This is predicted to tilt the rapid rG°"°'*rU=rGerU™ equi- 
librium in favour of rGerU*"” (40%) in RNA as compared to dGedT* 
(20%) in DNA (Methods). We find that in RNA, the ES1 rU-N3 CSs 
are slightly more downfield shifted (A@y3 = +30 p.p.m.) than rG-N1 
(Amy = +26 p.p.m.) (Fig. 4b). Reweighting the DFT-predicted CSs 
assuming 60:40 ratios of rG°"°'*rU:rGerU"™™ gives an excellent fit to 
RD-derived values (Fig. 4b), and are in better agreement than 80:20 
rG°"erU:rGerU™ (Extended Data Fig. 9). Changes observed in the 
ES1 CSs at higher pH suggest a potentially more complex exchange pro- 
cess (Supplementary Discussion 6). As with dGedT’, the WC-like rGerU- 
may exist in equilibrium with both a WC-like rGerU and/or an iWB 
rGerU (Fig. 4d). 

rGerU wobbles are widespread in RNA, where they play important 
structural and functional roles*”. We therefore examined whether the 
ESs observed here would occur in more complex RNA structural con- 
texts. Indeed, the rU-N3 °N RD profiles measured for two WB rGerU 
mispairs in a 69-nucleotide Bacillus subtilis guanine riboswitch (Ex- 
tended Data Fig. 8) provide initial evidence (A@y3 = +44-47 p.p.m. 


19 MARCH 2015 | VOL 519 | NATURE | 317 


©2015 Macmillan Publishers Limited. All rights reserved 


ARTICLE 


a i b 
: Guanine rGerU T1ES1 AES2 dGedT MES1 AES2 
hp-GU-20 25 IN1 co 2m (Hz 0 4 
RNA 400 600 1,000 St A 
uc 20 4,250 1,504,800 Pape da SC te on We | sees 
48 woh hot kX 1 tt} Sage 
UoG rorowotw @ f. ihis ip Q a & ry LN 
Be Pages it tldol , 8 rr 
= E€ at 1 ' ' ' 1 1 
ty Stee] & 0.0 4 
G—C © ®esaest012 3456 ¢ oxi ae os---*—*-—*'—*—* ! 
G—C c« Uracil > 30; ex 1 to tl a 06 7 
UoG & 80 fa} i a Br (Hz) 6 root] & oA = 4 
400 600 1,000 he OE rae ee 
G—cC 60 4,250 1,5001,800 § 40; ES1 ES2 oo a 02 7 t . " % 
A—U 40 so 2 Toru Rig Dy | 66 bi a 8 @ a oS 
on [ xX rGerU DFT ° 1 : T J Z 7 : 
C—G 20 Ate + 10 20 30 6 7 8 
G—cC 0 é Temperature (°C) pH 
5 -6-5-4-3-2-10 123456 RS Re Ss © 
Quy 21071 (KHz) e SY & s 
d 
WC rGerU"! pg, = 0.06% WC rGerlerU GS >99% 6 WC rGerU- ppg, = 0.16% 


R N-H----- fe) 
H’ (~40%) H’  (~60%) 


Figure 4| Transient tautomeric and anionic WC-like mispairs in A-form 
RNA. a, RNA duplex and R,, RD profiles with three-state global fits to rG-N1 
and rU-N3 (20°C and pH 7.9). Error bars represent experimental uncertainty 
(one s.d.). 77 = 0.66, R? = 0.99, and P< 0.0001. b, CSs for rGerU ES1/ES2 
compared to structure-based DFT predictions and rUTP ionization. Error bars 


and prs © 0.04% at pH 7.9) for WC-like rGerU" mispairs in more com- 
plex RNA structures (Extended Data Fig. 9 and Supplementary Discus- 
sion 7). Therefore, we expect that transient WC-like rGerU mispairs 
exist robustly across the RNA transcriptome. 


Biological implications 
Initial selection of NTPs during replication, and tRNAs during trans- 
lation, strongly relies on WC stereochemical geometry as a means of 
discriminating against mispairs***’. The low error rate (~10 *to 10°) 
during initial selection accounts for most of the overall fidelity of rep- 
lication (~10~° to 10” '°)*°” and translation (~107* to 10°). By 
stereochemically mimicking the WC geometry, the ES WC-like dGedT 
and rGerU mispairs observed here can provide a mechanism for evad- 
ing WC fidelity checks during initial substrate selection'*’°. The intrin- 
sic probabilities with which WC-like mispairs form have long been 
suspected as important determinants of substitution mutation prob- 
ability*"°. By carrying out the first measurements of the intrinsic prob- 
abilities with which WC-like dGedT and rGerU mispairs form in native 
DNA/RNA systems, we are able to obtain unique insights into the 
mechanisms of misincorporation and the potential roles of ES1/ES2. 
We find that the probabilities with which WC-like dGedT ES1 and 
ES2 form in duplex DNA (10° * to 10°) span the dGTPedT/dGedTTP 
misincorporation and base substitution probabilities seen during rep- 
lication using high-fidelity polymerases with little/no proofreading 
capabilities (Fig. 5a, Extended Data Fig. 10 and Supplementary Discus- 
sion 8)*°“**. Similarly, the WC-like rGerU ES1 and ES2 probabilities 
(10 * to 10° *) span the majority of amino acid misincorporation pro- 
babilities arising specifically due to rGerU pairing at any codon position 
(10 *to 10 °)34 (Fig. 5b and Supplementary Discussion 8). Note that 
some of the amino acid misincorporation probabilities (10°) are lower 
than the measured rGerU ES2 probability, which could be due to trans- 
lational proofreading* and/or lower pH conditions that destabilize ES2. 
These results, together with previous structural studies showing that 
WB and WC-like mispairs can exist within polymerase’*® and ribo- 
some**"* active sites, strongly suggest that energetic competition be- 
tween WB and WC-like mispairs is robust and is a key determinant of 
misincorporation probability during replication and translation (Sup- 
plementary Discussion 9). However, additional studies are needed to 
determine the probabilities with which WC-like mispairs form within 
the unique environment of polymerases and ribosomes. A recent MD 
study found that rG°"°*rU tautomers can be stabilized in a ribosome 


318 | NATURE | VOL 519 | 19 MARCH 2015 


> 


\ A PS 
pean C ae \ 2 WC rGreru 
—He==N UL Yee 
[@) YON 


iWB rGerU- 


reflect experimental uncertainty (one s.d.). c, Population and lifetime of rGerU 
ES1/ES2 measured as a function of temperature (at pH 6.9) and pH (at 20 °C). 
dGedT ES1 and ES2 shown (in grey) for comparison. Error bars reflect 
experimental uncertainty (one s.d.). d, Multi-state equilibrium between WB 
and WC-like rGerU mispairs (20 °C and pH7.9). 
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Figure 5 | Correlation between WC-like bp and misincorporation 
probabilities. a, dGedT ES1 (blue square) and ES2 (green triangle) 
probabilities; error bars reflect experimental uncertainty (one s.d.). pH- 
dependent dGTPedT*/dGedTTP* misincorporation probabilities (errors as 
published)*”. Averaged dGTPedT/dGedTTP misincorporation and base 
substitution probabilities; error bars reflect the s.d. (Supplementary 
Discussion 8). ES2 fit to Henderson—Hasselbalch equation shown as green line 
(see Methods). A blue line connects ES1 points for visualization purposes. 

b, rGerU ES1 (blue square, n = 9) and ES2 (green triangle, n = 5) probabilities 
measured at varied conditions (Supplementary Table 1), and amino acid 
misincorporation probabilities (red ‘X’, n = 104) due to rGerU mispairs as 
given in ref. 44. Errors bars in ES1/ES2 reflect experimental uncertainty (one 
s.d.). c, The probabilities of dGTP*dT/dGedTTP misincorporation (error as 
given) by avian myeloblastosis virus RT (ref. 47) versus the pK,-predicted 
probability of forming a WC-like dGedT mispair as a function of pH; 

error bars reflect experimental uncertainty (one s.d.). (GTPedT R? = 0.99 
and P = 0,0001. dGedTTP R? = 0.94 and P = 0.004. 
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context, but challenges their involvement in decoding errors based on 
predicted tRNA binding energies”. 

The pH-dependent dGedT misincorporation probability” points 
to the involvement of an anionic species in misincorporation’*’. Our 
results strongly suggest that this species is most likely to be ES2 dGedT™ 
and not the energetically disfavoured dG dT. We observe excellent 
agreement between the pK,-predicted probability of forming ES2 dGedT™ 
and pH-dependent dGTPedT/dGedTTP misincorporation probabil- 
ities measured for a reverse transcriptase” that lacks any proofreading 
ability (Fig. 5c and Extended Data Fig. 10). We note that the correla- 
tion is reduced at more extreme pH, near the pK,s of other ionizable 
groups in proteins and DNAs (Extended Data Fig. 10 and Supplemen- 
tary Discussion 10). We also find that dGe°?"dU" enhances the prob- 
ability of forming a WC-like ES2 (Fig. 3d) and correspondingly results 
in an approximately eightfold increase in dGe°""dU misincorporation”. 
These data indicate that, for this polymerase, misincorporation proceeds 
predominantly viaa WC-like dGedT as proposed”. It is very likely that 
parameters such as polymerase types, DNA sequence, and the chem- 
ical environment can affect the relative stabilities and lifetimes of the 
anionic, tautomeric, and WB mispairs*. Therefore we expect this to affect 
the flux through distinct misincorporation pathways involving different 
WCand WB conformations, which may help to explain the broad range 
of misincorporation probabilities. 

Our findings suggest that unconventional enol tautomeric and an- 
ionic bases exist robustly in genomes. We anticipate that these rare tau- 
tomeric and anionic bases play unique roles in DNA damage induction 
and repair, nucleic acid recognition, chemical modifications of nucleic 
acids, and catalysis. The NMR methods outlined here can immediately 
be applied to characterize tautomeric and anionic species, which we 
believe will not be restricted to dGedT and rGerU mispairs, but rather 
will be widespread across diverse nucleic acid motifs. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Sample preparation 

NMR buffer. All duplex DNA and RNA samples were buffer-exchanged using a 
centrifugal concentrator (EMD Millipore) into a solution containing 25 mM sodium 
chloride (100 mM sodium chloride for Dickerson-GT DNA), 15 mM sodium phos- 
phate, 0.1mM EDTA, and 10% D,O with variable pH (6.0, 6.4, 6.8, 6.9, 7.9). pH 
values of 8.4 were obtained for hp-GT DNA and hp-GU-20 RNA samples by direct 
titration of pH 7.9 samples with filtered 0.5 M NaOH solution. Monovalent ion 
concentration subsequently increased by a small amount proportional to the NaOH 
titrated in but did not affect DNA/RNA conformation as confirmed by NMR. Nat- 
ural isotopic abundance oligonucleotide sample concentrations ranged from 2 to 
3.5 mM. °C/*°N-labelled oligonucleotide sample concentrations ranged from 0.7 
to 3.5 mM. xpt-G riboswitch sample was diluted to ~30 tM in a solution contain- 
ing saturated guanine, denatured, and annealed on ice. Sample was buffer exchanged 
against either potassium acetate (pH 6.7) or sodium phosphate (pH 7.9) buffer and 
concentrated to 0.7-1.7 mM. Mg** was titrated in until total concentration was 
~5mM. 

Site-specifically *C/'°N-labelled DNA samples. Selectively '*C/!°N-labelled DNA 
samples (hp-GT DNA, °*®"U5-hp-GT and *8*G15-hp-GT DNA) were purchased 
from the Yale Keck Oligonucleotide Synthesis Facility and were synthesized using 
commercially available 2’-deoxyguanosine DMT-phosphoramidite (98% '°C10, 
98% '°NS5) and 2'-deoxythymidine phosphoramidite (98% '°C10, 98% '°N2) pur- 
chased from Cambridge Isotope Labs. hp-GT DNA was selectively *C/"*N-labelled 
at dT5 and dG15, °PrU5-hp-GT DNA was 13C/*°N-labelled at dG15, and ®®"G15- 
hp-GT was °C/'°N-labelled at dT5. Samples were purified using RP-HPLC before 
buffer exchange. NMR experiments were used to confirm native folding of hp-GT, 
58tU5-hp-GT, and ®"G15-hp-GT DNA constructs. 

Enzymatic synthesis of '*C/'°N-labelled DNA samples. The Dickerson-GT DNA 
sample was synthesized in vitro using uniformly enriched '°C/"°N dGTP and dTTP 
(Silantes GmbH) as previously described". Reaction mixture was centrifuged and 
filtered to remove excess pyrophosphate and concentrated down to 1 mlina3 kDa 
molecular weight cut-off centrifugal concentrator (EMD Millipore). Sample was 
mixed with 1 ml of a formamide-based denaturing loading dye, denatured at 95 °C 
for 5 min, and loaded onto a 33 X 102 cm sequencing gel (20% polyacrylamide/8M 
urea) and run for 12h to resolve target oligonucleotide from template and other 
nucleic acid species. Target band was shadowed briefly using a UV hand-lamp and 
excised before gel electroelution (Whatman, GE Healthcare), followed by ethanol 
precipitation. Sample purity was confirmed using gel electrophoresis (20% poly- 
acrylamide/8M urea) stained with SybrGOLD before buffer exchange. 
Enzymatic synthesis of '*C/'°N-labelled RNA samples. The hp-GU-20, hp-GU- 
24 and xpt-G riboswitch RNA samples were prepared using in vitro transcription 
as previously described’> using uniformly enriched '°C/"°N ribonucleotide tripho- 
sphates (hp-GU-20 RNA: rGTP & rUTP only, hp-GU-24: All and xpt-G: All). Pu- 
rification was carried out as described above for '°C/’°N-labelled DNA construct. 
Unlabelled and unlabelled-modified DNA samples. hp-GT and Dickerson-GT 
DNA constructs at natural isotopic abundance were purchased from Integrated DNA 
Technologies. The 06-methyl-2’ -deoxyguanosine mismatch constructs (™°G15- 
hp-GT and ™°G4-Dickerson-GT DNA) were purchased from the Yale Keck Oli- 
gonucleotide Synthesis Facility. hp-GT and Dickerson-GT DNA constructs were 
desalted before buffer exchange. Unlabelled-modified samples were purified using 
RP-HPLC before buffer exchange. 

Isotopically enriched dNTP and rNTP samples. Uniformly '°C/'°N enriched 
dGTP, dTTP, rGTP and rUTP samples were purchased (Silantes GmbH) and added 
to an NMR buffer (25 mM sodium chloride, 15 mM sodium phosphate, 0.1 mM 
EDTA and 10% D20 at pH 6.9). Sample was adjusted to pH ~ 12.5 directly using 
5M NaOH. 

NMR experiments. Resonance assignment. The CS assignment for DNA and RNA 
constructs were obtained using aromatic (8c, 1H], aliphatic (Bc, 'H], imino [PN, 
*H] heteronuclear and [’H, 'H] nuclear Overhauser effect spectroscopy (NOESY) 
homonuclear correlation experiments. The data for all DNA constructs were ac- 
quired on an 18.8 T Agilent spectrometer equipped with a triple resonance HCN 
cryogenic probe, for the uniformly '°C/"°N-labelled hp-GU-24 RNA construct on 
a 14.1 T Bruker Avance spectrometer equipped with a triple-resonance HCN cryo- 
genic probe, and for the xpt-G RNA riboswitch on a 14.1 T Agilent spectrometer 
equipped with a Bruker HCPN cryogenic probe. All data were processed and ana- 
lysed using the software NMRpipe”’ and SPARKY (T. D. Goddard and D. G. Kneller, 
SPARKY 3, University of California, San Francisco), respectively. Resonance assign- 
ment for exchangeable and non-exchangeable 'H was performed using the 2D ['H, 
'H] WATERGATE NOESY experiment” (mixing time 250 ms) as described prev- 
iously*”™, and their covalently bonded '*C/"°N nuclei were assigned using hetero- 
nuclear single/multiple quantum coherence correlation experiments (HSQC or 
HMQC). For the labelled RNA constructs conventional HSQC experiments” were 
acquired for all spins, while for unlabelled DNA constructs conventional HSQC 
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was used for the aliphatic C1’ spins and band-selective optimized flip angle short- 
transient (SOFAST) HMQC**** were employed for the imino and aromatic spins. 

INR, p relaxation dispersion. 1D DN R,, RD experiments” targeting imino 
nitrogen resonances of interest were carried out at 14.1 T (hp-GT, ae "U5-hp-GT, 
Dickerson-GT DNA and hp-GU-24 RNA) and 16.4 T (hp-GU-20 RNA) as prev- 
iously described’’. Raw data were processed using NMRpipe™ to generate a series 
of peak intensities. On- and off-resonance Rp RD profiles were recorded using 
spinlock powers (wg, 21 1) ranging from 100 to 2,000 Hz, with the absolute offset 
frequencies (Q 2n' Hz) ranging from 0-3.5x the applied spinlock power (Sup- 
plementary Table 1). Offset frequencies greater than 3.5 the given spinlock power 
were not used owing to substantial R, relaxation contributions”’. Magnetization of 
the spins of interest was allowed to relax under an applied spinlock for the follow- 
ing durations: 0-120 ms for N1/N3 of hp-GT, *"U5-hp-GT, Dickerson-GT DNA 
and hp-GU-24 RNA; 0-100 ms for N1/N3 of hp-GU-20 RNA; and 0-80 ms, 
0-74 ms, 0-68 ms for N3 of the xpt-G riboswitch. 

BCR, p relaxation dispersion. 2G R,, RD experiments targeting carbon reso- 
nances of interest were carried out at 14.1 T as previously described**”’. On- and 
off-resonance R,, RD profiles were measured using spinlock powers (s,) ranging 
from 150 to 3,500 Hz, with the absolute offset frequencies (Q 2n!, Hz) ranging 
from 0-3.5X the applied spinlock power (Supplementary Table 1). Magnetization 
of the spins of interest were allowed to relax under an applied spinlock for the fol- 
lowing durations: 0-60 ms for C1’/C6/C8 of hp-GT DNA and C1’ of Dickerson- 
GT, 0-50 ms for C6/C8 Dickerson-GT DNA. 

*5C/PN CSs of ionized dNTPs and rNTPs. Carbon and nitrogen CSs of neutral 

and deprotonated dNTPs and rNTPs were collected at 25°C and pH ~ 6.9 and 
~ 12.5 using a 2D ['°N, °C] HMQC experiment ona 14.1 T Agilent spectrometer 
equipped with a Bruker HNCP cryoprobe. 
Analysis of R,, data. Fitting of °C and *°N Rj, data. R,, values were calculated 
by fitting the decay of peak intensity versus relaxation delay toa monoexponential”. 
Errors in Ri, were estimated using spectral noise and duplicate R,, data points using 
a suite of Mathematica notebooks”. Measured on- and off-resonance R; , data were 
globally fit to algebraic equations describing N-site chemical exchange using a 
Levenberg-Marquardt method, weighted to the experimental error in the Rp data. 
For two-state exchange, data was fit to the Laguerre equation (1)°° under the valid 
assumption that the populations are highly asymmetric, such that pgs >> pes where 
Prs < 0.3. For the three-state chemical exchange model where kgc = kcx = 0, data 
was fit to both the three-state Laguerre equation (2) as well as the general three- 
state equation (3)”, where pgs > Pes and prs < 0.1. Fitted parameters derived from 
equation (2) and equation (3) are in excellent agreement with one another (Sup- 
plementary Table 1). Statistical tests, F-test and Akaike information criterion (AIC), 
were used to select the best-fit N-state exchange model**” (Extended Data Fig. 3). 
Cand '°N RD data from the dGedT or rGerU mispair resonances for each con- 
struct, at each temperature and pH condition were fitted globally (where k,, and 
ppare the shared-parameters) when possible. In the cases where '°N RD data was 
available but '*C RD data showed no chemical exchange, did not exhibit three-state 
exchange, or was not collected, the 15N) N1/N3 RD data were globally fitted as 
described above. Bloch-McConnell® (B-M) numerical simulations were used to 
validate the algebraic approximations for two- and three-state exchange scenarios. 
Two-state Laguerre equation”: 
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Three-state Laguerre equation with no minor exchange”: 
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in which R, and R; are the intrinsic longitudinal and transverse relaxation rates 
(s~'). The exchange rates are defined as kex; = Kas_szsi + kesiocs Where kes esi = 
Prsikex: and kpgi.cs = Paskexi and where i = 1 or 2. The CS difference between the 
GS and ESs is given by A@gs; = Qusi — Qes, where Q = Qops — y+ defines the re- 
sonance offset from the carrier frequency (@;£), Qos = (QasPas + QesipPesi) OF 
Qoos = (RasPas + Qesipesi + QesePes2), and where Qop5 + Qcs when pes > Pes 
as is the case in the ESs measured. The average effective spinlock field in the ro- 
tating frame is given by og = Q? + wg and w gg = (Qeg- Oy4)" + W's] OES; = 
(Qusi - Oxf)” + 51, and (sy, is the spinlock power. The tilt angle in the rotating 
frame is given by 0 = arctan(@g;/@). 

Analysis of the three-state exchange model. We repeated measurements of RD 
profiles for hp-GT dG-N1 and dT-N3 at pH 8.4 and 25 °C on a different spectro- 
meter and also obtained data that is consistent with three-state exchange (Extended 
Data Fig. 2 and Supplementary Table 1). In addition, we collected one additional 
spinlock power (800 Hz) for both resonances, and find that the inclusion of this 
extra spinlock power has negligible effect on the fitted exchange parameters indi- 
cating that they are robustly determined by the measured data. We note that differ- 
ences in the RD profiles and fitted parameters between spectrometers are largely 
within error, with minor differences likely arising due to small differences in tem- 
perature and/or spinlock calibrations. 

The dT-N3 resonance of hp-GT DNA at 25 °C and pH 7.9 was also individually 
refit assuming both two-state and three-state exchange. The individual dT-N3 
three-state fit gave very similar fitted parameters as the dT-N3 in the three-state 
global fit with dG-N1. The F-test (at 0.05 significance level) favoured the three- 
state individual fit model over the two-state individual fit model. AIC also favoured 
the three-state model, estimating the three-state model to be 3.1 X 10° times more 
likely to be the correct model than the two-state model. The individual two-state and 
three-state fits to dG-N1 RD profiles give similar agreement (7 = 0.83) when the 
ES2 CS is fixed based on the globally fitted value; however, statistical tests slightly 
favour the simpler model (AIC favours two-state by approximately twofold). 

The three-state model is supported by statistical tests (F-test and AIC favour 
the three-state over two-state exchange model) and also by B-M simulations (data 
not shown) for the hp-GT dG-N1 and dT-N3 data at pH 7.9. Here, R;p values were 
simulated, noise corrupted, and evaluated for the ability to report on the three-state 
exchange following the procedure reported in Bothe et al.®. 

Bloch-McConnell three-state numerical simulations. Parameters derived from 
the algebraic three-state fits, along with the w,¢ and wg, were used to simulate 
numerical solutions to the three-state B-M equations” (Extended Data Fig. 3). The 
B-M simulations were carried out using a procedure similar to that described recently 
for two-state exchange®’. Simulations were carried out assuming a 0.25 s relaxation 
delay under the applied sy. 

Thermodynamic analysis of R,, RD-derived parameters. Temperature-dependent 
analysis of forward and reverse exchange rates in the site-labelled (dG15edT5 
®¢/N) hp-GT DNA and G/U labelled hp-GU-20 RNA samples were carried out 
as previously described”. This analysis employed the ES populations and exchange 
rates obtained based on two-state global fitting of dG15-N1 and dT5-N3 RD data 
measured in hp-GT DNA at 10, 20, 25 and 30 °C. For hp-GU-20 RNA, the analysis 
employed populations and exchange rates obtained based on two-state and three- 
state global fitting of rG16-N1 and rU5-N3 RD data measured at 10, 20, 25 and 
30 °Cat pH 6.9. Errors in the fitted thermodynamic parameters are given by weighted 
fits of the modified van’t Hoff equation” to the RD-derived forward and reverse ex- 
change rates and their errors. The NMR sample temperature was calibrated using 
99.8% methanol-d, (Cambridge Isotope Laboratories) using the equation T= 
—16.7467(Ad)* — 52.5130(Ad) + 419.1381, where Ao is the difference in CS (p.p.m.) 
between the hydroxyl and methyl proton™. 

Density functional theory geometry optimizations and CS calculations. All DFT 
calculations® were performed using Gaussian 09c (Gaussian, Inc.)® and carried 
out on the University of Michigan’s Advanced Research Computing HPC cluster, 
Flux, as previously described** with minor modifications to DFT method and basis 
set used. Geometry optimizations were carried out using the empirical exchange- 
correlation functional, M06-2X°, with the 6-31 + G(d,p) basis-set. The BCand PN 
isotropic magnetic shielding (¢,3c and 015) were calculated using the GIAO method 
with M06-2X/6-31+G(d,p). CSs of the nucleobases (6;3¢ and 6;5n) were calcu- 
lated by d13c/15N = 013C/15N — OrMs/NH3> Where Orms and ony are the isotropic 
magnetic shieldings calculated for the reference compounds trimethylsilane (°C) 
and NH; (!°N), respectively. 

Input structures for DFT calculations. We generated idealized B-/A-form heli- 
ces corresponding to our sequence contexts (hp-GT DNA and hp-GU-24 RNA) 
using make-NA (J. Stroud, make-NA, http://structure.usc.edu/make-na/server. 
html; 2011). The duplexes were truncated to the trinucleotide step centred on 
the target mispair (GTG/CGC for hp-GT DNA and GUA/UGC for hp-GU-24 RNA). 
The sugar and phosphate moieties were removed and replaced with a methyl for i 
(dGedT or rGerU mispair), i+ 1, and i — 1 base pairs to save on computational 


time. Although the structures lack a 2'-deoxyribose or ribose sugar moieties, they 
will be denoted as dGedT or rGerU to avoid confusion. All heavy atoms were fro- 
zen for the i+ 1 and i— 1 base pairs while geometry optimizations were carried 
out for protons and heavy atoms of the central dGedT or rGerU base pair as well as 
the protons of thei + 1/i— 1 pairs. We performed full geometry optimizations on: 
dGedT and rGerU GS WB pairs, dG°°°'*dT, dGedT*", rG°"'*rU, rGerU"", and 
dG edT and rG erU ES WC-like pairs. All converged to the expected WB or WC- 
like geometries. In the instance where the geometry optimizations of dGedT” and 
rGerU' starting states failed to converge to a stable WC-like dGedT/rGerU" geo- 
metry (and instead converged to an iWB geometry, in vacuum), the WC-like 
dGedT“"°'/rGerU"™" states were converted to dGedT -/rGerU- and geometry opti- 
mizations were carried out on the protons only. CSs for each state were calculated 
and later used in calculating population weighted CSs assuming different popula- 
tions of these mispair species. In addition, while the sugar moieties were truncated 
to methyl groups to save on computation time, it should be noted that previous 
DFT studies of nucleotides have shown that tautomerization, primarily of pyri- 
midines, can have an effect on the sugar conformation®”. However, we can rule out 
large changes in sugar pucker arising in ES1 based on the negligibly small chemical 
exchange contributions to both dG-C1’ and dT-Cl’ (see Extended Data Fig. 2) and 
only very small changes in dG-C1' and dT-C1’ CSs upon locking the enol-like form 
with ™°dGedT (|Awcy'| < 0.7 p.p.m., see Fig. 3b and Extended Data Fig. 7). Finally, 
while the CSs of the anionic pairs are predicted assuming planar pair geometry, 
prior computational studies of GeT” and GT nucleobase pairs in isolation have 
shown that they can favour non-planar and non-WC geometry”. 

dG*"*'*dT distance dependent DFT calculations. We carried out distance depen- 
dent DET calculations® ona pair of WC-like dG*"°'*dT N1/N9-methyl nucleobases 
in vacuum using the M06-2X method and 6-31+G(d,p) basis-set, as described 
above. The geometry ofa dGedT wobble pair and dG°""'*dT WC-like pair was op- 
timized with no constraints before CS calculations. The ideal N1-N3 distance of 
the dG**"*'edT WC-like mispair was then manually varied from 2.44 A to 3.8 A in 
increments of 0.1 A from 2.44-3.04 A and then to 3.8 A (Extended Data Fig. 4). At 
every increment the proton positions alone were optimized and CSs were calcu- 
lated relative to an optimized dGedT wobble base pair. 

Population-weighted average DFT-predicted CS calculations. Based on the com- 
putationally predicted energetic differences between interconverting dG*"sdT and 
dGedT“""' base mispairs in water (AG = 0.7-0.8 kcal mol” ')! and ina weakly polar 
medium (AG = 0.99 kcal mol” ')®, we can predict that the dGedT ES1 CSs rep- 
resent a population weighted average between interconverting dG*"°'*dT(~80%) 
=dGedT™°'(~20%) states. Thus, the DFT-predicted CSs for dG?edT and 
dGedT*""' were summed in a population-weighted manner. It is noted that the 
computationally predicted energetic stabilities of the tautomeric states differ when 
calculated in water versus vacuum, or a weakly polar medium, with the values pre- 
dicted in water giving the greatest agreement with our experimental results. 

In the case of rGerU ES1, computational studies have shown that dGeadu™ 
is ~1kcalmol~' more stable than dGedT®” in a DNA fragment*, suggesting 
that an rG°?'*rU=rGerU™ equilibrium should be titled slightly more towards 
rGerU™" than dGedT” in DNA. We can qualitatively estimate the relative sta- 
bility between rG*"°'*rU and rGerU"""' to be 60:40 based on a best fit to the RD- 
derived CSs. 
pK, fitting and probability estimation. The apparent pK,s for hp-GT ES2, °*"U-hp- 
GT ES2, and dGTPedT misincorporation (pH 6.5-8.6) were fit to the Henderson- 
Hasselbalch equation using a Monte-Carlo (MC) approach. Here, 10° pgs at pH 7.9 
and/or 8.4 were selected from a Gaussian distribution with mean px value and stan- 
dard deviation representing the uncertainty in pg based on fitting of the RD data. 
10° fits to equation (4) were then carried out assuming these pp values to generate 
10° pK,s. 


1oPH— PKs ) (4) 


p= (aE 


Where ps is the probability of forming ES2 or dGTPedT misincorporation prob- 
ability at a given pH. The resulting fitted pK, values were fitted to a Gaussian dis- 
tribution. The mean value of the Gaussian distribution is the reported pK, value 
and the standard deviation is assumed to be the error. An analogous approach was 
used to back-calculate predicted pgs at a given pH using the pK, derived by the 
above method. 
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Extended Data Figure 1 | NMR spectra of site- and selectively-labelled 
dGedT mispair DNA constructs. a, b, Shown are the hp-GT DNA (a) and 
Dickerson-GT (b) constructs with °C/°N labelled dGedT mispairs 


highlighted in red along with 2D imino [‘°N, 'H] HSQC, 2D aromatic [‘°C, *H] 
HSQC and 2D C1’ ['°C, 'H] HSQC spectra (pH 6.9, 25 °C). 
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Extended Data Figure 2 | Rotating frame relaxation dispersion profiles of process (dash). The hp-GT dG15-N1 and dT5-N3 in brackets denote duplicate 


dGedT mispairs in hp-GT and Dickerson-GT DNA constructs. RD profiles _ profiles (with an additional 800 Hz spinlock power for each) collected at 


showing chemical exchange (R, + R.x) in the dGedT mispair as a function pH 8.4 and 25 °C collected on a different spectrometer from the preceding 
of the spin lock offset (Qepp2m ') and spin lock power (os, 2x ', colour coded _ profiles. c, SN and °C RD profiles for Dickerson-GT. Sample conditions 
in insets). a, b, Shown are '°N (a) and '°C (b) RD profiles in hp-GT DNA. are indicated on each profile. Error bars represent experimental uncertainty 
On-resonance profiles showing solid and dashed black lines indicate (one s.d., see Methods). 


fits assuming no chemical exchange (solid) and simplified two-state exchange 
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Extended Data Figure 3 | Multiple site exchange comparison and numerical 
solutions. a, b, Global fitting of hp-GT DNA (a) and hp-GU-20 RNA (b) N1 
and N3 RD profiles to two-state algebraic equation (equation (1), fit reduced 7” 
shown in inset) and three-state algebraic equation (equation (2), fit reduced ~ 
shown in inset). Numerical solutions to the Bloch-McConnell three-state 


equations assuming no minor exchange and input exchange parameters 
obtained based on the three-state algebraic fit are also shown to establish 

the validity of the three-state expression under these exchange scenarios 
(equation (2), see Methods). Sample conditions are indicated on each profile. 
Error bars represent experimental uncertainty (one s.d., see Methods). 
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Extended Data Figure 4 | Chemical shift fingerprinting dGedT excited 
states. a, RD-derived dG15-N1 and dT5-N3 chemical shifts (CSs) (referenced 
to GS WB) for ES1 (25 °C and pH 6.9) and ES2 (25 °C and pH 8.4) of hp-GT 
and ES1 of Dickerson-GT (25 °C and pH 6.9) are shown. Errors in all RD- 
derived fitted parameters (for example, Aq) reflect experimental uncertainty 
(one s.d.) from the weighted global fit (see Methods). b, RD-derived hp-GT 
dGedT ES1 (blue) and ES2 (green) 1SN CSs are shown as a function 

of temperature and pH for both dG15-N1 (square) and dT5-N3 (circle). 

c, Scheme used to calculate CSs using DFT (see Methods). Shown is a schematic 
representation of scenario used to for calculating CSs using DFT. Idealized 
B-form DNA helix is generated to give a central dGedT mispair (red) that is 
flanked by canonical dGedC pairs, analogous to the hp-GT construct. Residues 
are trimmed to 1-/9-methyl bases and i + 1/i — 1 pairs are frozen in place 
for subsequent geometry optimizations and NMR CS calculations. 
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d, DFT-calculated CSs (referenced to an energy optimized WB geometry) are 
shown for various tautomeric and anionic configurations, where dG°"°'+dT/ 
dG-dT®” represents population-weighted average over dG°"°'*dT (80%) and 
dGedT@"" (20%). e, RD-derived ES1 and ES2 CSs are plotted against DFT- 
calculated CSs of base opened dGedT mispairs, taken from X-ray structures and 
pruned to 1-/9-methyl bases. f, DFT-calculated CSs (referenced to an energy 
optimized WB geometry) are plotted as a function of dG-N1 to dT-N3 inter- 
atomic distance for a WC-like dGedT tautomeric pair. g, Computational 
studies*'**”° predict that the tautomeric pathway proceeds via a planar 
dG*sdT™ ion pair (charge delocalization is implied) that is highlighted by a 
network of five H-bonds. h, Predicted pair geometry of an anionic dGedT™ 
inverted wobble. Deprotonated dT-N3 is highlighted in red (charge 
delocalization is implied). i, Predicted pair geometry of a dG°?°'*dT 
Hoogsteen mispair. 
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Extended Data Figure 5 | Attempts to trap anionic dG. a, 1D '°C spectra 
(without '°C-'°C homonuclear decoupling) of the aromatic carbon region of 
protonated dGTP (black) and anionic dGTP (red) showing CS perturbations 
induced upon deprotonation of dGTP-N1. b, '°C spectra (without °C-1°C 
homonuclear decoupling) of the aromatic carbon region of protonated dTTP 
(black) and anionic dTTP (red) showing CS perturbations induced upon 
deprotonation of dTTP-N3. c, 2D ['°N, 3C] HMQC spectra of dGTP showing 
CS of dGTP-N1 induced upon deprotonation. The spectra is rotated by 90°, to 
depict '°N CS along x-axis for visualization purposes. Red circles on inset 
structure highlight measured resonances (C6 and N1). d, 2D [N, °C] HMQC 
spectra of dTTP showing CS perturbation of dTTP-N3 induced upon 
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deprotonation. The spectra is rotated by 90°, to depict °N CS along x-axis for 
visualization purposes. Red circles on inset structure highlight measured 
resonances (C4 and N3). e, hp-GT DNA spectra of the dG/dT aromatic carbons 
upon increase in pH from 6.9 (black) to 10.7 (red). Minor upfield CSs are 
observed for dT5-C6 and dG9-C8, but not dG15-C8, indicating the dT5 in the 
dGedT mispair is likely undergoing deprotonation and not the paired dG15. 
f, °°"G15-hp-GT DNA construct bearing a °C/'°N site-labelled dT5 paired 
with a 8-bromo-2’-deoxyguanosine is shown (left) along with the '"N RD 
profile for the paired dT5-N3. Error bars represent experimental uncertainty 
(one s.d., see Methods). 
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hp-GU-20 RNA 6.9 rG16rU5 ES1 3.86+0.02 5.4841.03 1.6241.03 15.740.01 13.80.46 -1.8740.46 11.8+0.01 8.3640.92 -3.4840.92 — 
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*hp-GU-20RNA 8.4 rG16rU5 ES2 3.29+0.11 - - 15.940.08 - - 12.640.08 - x 
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Extended Data Figure 6 | Kinetic-thermodynamic plots and parameters. temperature (see Methods), wherein enthalpic and entropic parameters cannot 
a, Kinetic-thermodynamic diagram for exchange between GS and ES1 via a be derived. Here, dG15edT5 ES2 values were calculated at 25.05 °C, rG16erU5 
transition state for hp-GT DNA ES1 (left) and hp-GU-20 RNA ES1 (right), ES2 values were calculated at 20.05 °C, and dG15e°®"dUS5 ES1 and ES2 
showing activation (G*) and net free energy (G), enthalpy (H),andentropy(TS) values were calculated at 10.05 °C. Error reflects experimental uncertainty (one 
changes (referenced to 0). b, Kinetic-thermodynamic parameters derived s.d.) of the weighted global fits of the corresponding RD profiles. Error is 
from RD data. Asterisk denotes parameters calculated using only a single propagated using the respective uncertainties in k,, and pgs. 
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and CSs are highlighted in red. b, Similarly, "°G4-Dickerson-GT DNA 
construct is shown (left) where dG4 is O°%-methylated to trap a WC-like state, 
with similar colour scheme as a. c, ° ®'U5-hp-GT DNA construct bearing a 
'8C/PN site-labelled dG15 paired with a 5-bromo-2'-deoxythymidine is shown 
(left) along with the °N RD profile for the paired dG15-N1. Error bars 
represent experimental uncertainty (one s.d., see Methods). 


Extended Data Figure 7 | Trapping or stabilizing dG*dT ES1 and ES2. 

a, ™6G15-hp-GT DNA construct is shown (left) where dG15 is methylated at 
the O° position to trap a near-WC “dG°"'*dT”-like geometry (Fig. 3c). CS 
perturbations induced in the aromatic (centre) and sugar (right) resonances 
upon O°-methylation (blue) with the hp-GT DNA spectra (black) with the 
resonances for the dGedT mispair from hp-GT DNA in red. ™°dGedT mispair 
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Extended Data Figure 8 | Rotating frame relaxation dispersion profiles for 
rGerU mispairs in hp-GU-20, hp-GU-24 and xpt-G RNA constructs. 

a, b, RNA constructs and the imino [DN, 1H] HSQC zoomed into the rGerU 
wobble region of the spectra for hp-GU-20 and hp-GU-24. rGerU mispair 
resonances are shown in red. c, The Bacillus subtilis guanine binding riboswitch 


(xpt-G RNA)” construct and full imino [‘°N, 'H] HSQC of folded and guanine 
ligand-bound riboswitch. rGerU mispair resonances are shown in red. d-f, aN 
RD profiles for hp-GU-20 (d) hp-GU-24 (e) and xpt-G (f) riboswitch 

RNA. Error bars represent experimental uncertainty (one s.d., see Methods). 
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Extended Data Figure 9 | CS fingerprinting rGerU excited states. a, RD- 
derived rG16-N1 and rU5-N3 CSs (referenced to GS WB) are shown for ES1 
(20 °C and pH 6.9) and ES2 of hp-GU-20 (20 °C and pH 7.9) and ES1 rG18-N1 
and rU7-N3 CSs of hp-GU-24 (25 °C and pH 6.9). Errors in all RD-derived 
fitted parameters (for example, Ac) reflect experimental uncertainty (one s.d.) 
from the weighted global fit (see Methods). b, RD-derived CSs (referenced to 
GS WB) are shown for the ES of xpt-G riboswitch (rU17-N3 and rU69-N3) at 
25°C and pH7.9. c, 2D ['°N, °C] HMQC spectra of rUTP showing CS of 
rUTP-N3 induced upon deprotonation. The spectra is rotated by 90°, to depict 
'SN CS along x-axis for visualization purposes. Red circles on inset structure 
highlight measured resonances (C4 and N3). d, 2D (°N, °C] HMQC spectra of 
rGTP showing CS of rGTP-N1 induced upon deprotonation. The spectra is 
rotated by 90°, to depict '°N CS along x-axis for visualization purposes. Red 
circles on inset structure highlight measured resonances (C6 and N1). e, RD- 
derived hp-GU-20 rGerU ES1 (blue) and ES2 (green) CSs are shown as a 
function of temperature and pH for both rG16-N1 (square) and rU5-N3 


(circle). f, Scheme used to calculate CSs using DFT. Idealized A-form RNA helix 
is generated to give a central rGerU mispair (red) that is flanked by canonical 
rGerC and rAerU pairs, analogous to the hp-GU-24 construct. Residues are 
trimmed to 1-/9-methyl bases and i+ 1/i— 1 pairs are frozen in place for 
subsequent geometry optimizations and CS calculations (see Methods). 
g, DFT-predicted CSs (referenced to an energy optimized WB geometry) are 
shown for various tautomeric and anionic configurations, where rG°"'*rU/ 
rGerU*"® represents population weighted average CSs of rG*"*'*rU (60%) and 
rGerU“"*! (40%). h, °N rG-N1 and rU-N3 CS comparison between RD-derived 
ES1 CSs and population weighted DFT-predicted CSs (60:40 vs. 80:20). 
i, Computational studies” predict that the tautomeric pathway for a rGerU pair 
can proceed via a planar rG* srU" ion pair (charge delocalization is implied) 
that is highlighted by a network of five H-bonds. j, Pair geometry of an 
anionic rGerU_ inverted wobble. Deprotonated rU-N3 is highlighted in red 
(charge delocalization is implied). 
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Extended Data Figure 10 | dGedT Misincorporation probabilities and 
correlation to WC-like excited states. a, Explicit (GTPedT and dGedTTP 
kinetic misincorporation and base substitution probabilities (n = 53) and 
associated errors*®***’*-” (see Supplementary Discussion 8) are plotted against 
hp-GT dGedT ES1 (blue squares) and ES2 (green triangles). The pK, fit of ES2 
probabilities to the Henderson-Hasselbalch equation (equation (4), see 
Methods) is shown as the green trend line. b, Red trend line shows the pK, fit to 


ARTICLE 


20 
~e dGTP+dT pK, fit 

16 
- 
S 
4492 ES1 
2 
5 
ie ES2 
o 

4 

0 | | | | ! | 

6.0 6.5 7.0 7.5 8.0 8.5 
pH 
pH 

d 65 7 7.5 8 83 9 95 

p> E 

=. 

oO 

2 

£ 103 

See 

a c AN 

2 ay 

g A 

fo) L my + 

e ra 

9 10°RA WA 

& Flo oo 

Q Ec 

= [ 

a 

= 

5 10°E 

oO c 

me) 

S 10+ 10° 


Extrapolated dGedT- ES2 Probability 


dGTPedT misincorporation probabilities” from pH 6.5-8.6 to the Henderson-— 
Hasselbalch equation. The fit was weighted using reported experimental 
errors and gave a reduced ¢ of 3.56. c, d, Extrapolated dGedT ES2 probability 
(s.d. from the weighted global fit) is plotted against dGTPedT (left) and 
dGedTTP (right) misincorporation probabilities (errors as given)*” from 
pH6.5-9.5. 
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Crystal structure of the eukaryotic origin 


recognition complex 


Franziska Bleichert', Michael R. Botchan” & James M. Berger’ 


Initiation of cellular DNA replication is tightly controlled to sustain genomic integrity. In eukaryotes, the heterohexameric 
origin recognition complex (ORC) is essential for coordinating replication onset. Here we describe the crystal structure 
of Drosophila ORC at 3.5 A resolution, showing that the 270 kilodalton initiator core complex comprises a two-layered 
notched ring in which a collar of winged-helix domains from the Orcl-5 subunits sits atop a layer of AAA+ (ATPases 
associated with a variety of cellular activities) folds. Although canonical inter-AAA+ domain interactions exist between 
four of the six ORC subunits, unanticipated features are also evident. These include highly interdigitated domain-swapping 
interactions between the winged -helix folds and AAA+ modules of neighbouring protomers, and a quasi-spiral arrange- 
ment of DNA binding elements that circumnavigate an approximately 20 A wide channel in the centre of the complex. 
Comparative analyses indicate that ORC encircles DNA, using its winged-helix domain face to engage the mini-chromosome 
maintenance 2-7 (MCM2-7) complex during replicative helicase loading; however, an observed out-of-plane rotation of 
more than 90° for the Orcl AAA+ domain disrupts interactions with catalytic amino acids in Orc4, narrowing and sealing 
off entry into the central channel. Prima facie, our data indicate that Drosophila ORC can switch between active and 
autoinhibited conformations, suggesting a novel means for cell cycle and/or developmental control of ORC functions. 


The faithful replication of chromosomes relies on evolutionarily con- 
served initiator proteins to recruit ring-shaped helicases to DNA in a 
cell-cycle-regulated manner (reviewed in refs 1-3). Replication initia- 
tors belong to the AAA+ protein superfamily, a large group of multi- 
subunit nucleotide hydrolases that function as motors or molecular 
switches in many cellular processes*, AAA+ NTPases assemble into 
homo- or hetero-oligomeric complexes that actively alter the confor- 
mation or position of client macromolecules in response to ATP bind- 
ing and hydrolysis. 

In eukaryotes, replication initiation is promoted by the six-subunit 
ORC assembly”®. Five of the six subunits of ORC (Orcl-5) retain AAA + 
modules*”~°, while the sixth (Orc6) is composed of tandem cyclin-box 
folds similar to transcription factor IIB (TFIIB)'®’’. During initiation, 
ORC binds replication origins, recruiting another AAA+ ATPase, Cdc6, 
to DNA ina nucleotide-dependent manner (reviewed in refs 3, 12). The 
DNA-bound ORC*Cdcé6 complex in turn recruits the MCM2-7 repli- 
cative helicase and its associated Cdt1 chaperone to origins, promoting 
the loading of MCM2-7 complexes onto DNA (reviewed in refs 3, 13). 

So far, many of the molecular mechanisms by which ORC assembles 
and operates have remained enigmatic. To better understand ORC func- 
tion, we therefore determined the crystal structure of the Drosophila 
complex to 3.5 A resolution. The structure highlights a domain-swapped 
organization for ORC and captures the complex in an unanticipated, 
autoinhibited conformation. Analysis of the structure leads to a revised 
model for DNA binding and proposed ORC*-MCM2-7 contacts, and 
adds to a stepwise series of assembly and conformational intermediates 
that help account for how the complex acts during the early stages of 
replication initiation. 


Crystal structure of the ORC hexamer 

Sequence analyses had indicated that the Orcl—-5 subunits would share 
a domain architecture similar to that of archaeal Orc proteins, with an 
AAA+-type ATPase fold fused to at least one carboxy (C)-terminal 


winged-helix (WH) DNA-binding domain (Fig. 1a)”*. For its part, the 
Orc6 C terminus has been reported to bind to ORC1-5 through a 
domain insertion in Orc3, leaving its TFIIB-like domain conformation- 
ally independent of the ORC core (Fig. 1a)'*. For crystallizing Drosophila 
ORC, we designed a ‘trimmed’ construct lacking the flexible amino (N)- 
terminal extensions of Orcl, Orc2 and Orc3 (ref. 14), and the Orc6 
TFIIB region (Fig. 1a). Neither modification interfered with ORC assem- 
bly, nor did they affect the overall architecture of ORC (Extended Data 
Fig. la-c). This ORC core (referred to as ORC hereafter) crystalized in 
space group 1222 with one Orcl-6 heterohexamer per asymmetric unit. 
The structure was phased by single-wavelength anomalous dispersion 
and refined to 3.5 A with Rwork/Reree values of 0.22/0.26 (Extended Data 
Fig. 2a-c and Extended Data Table 1). 


AAA+ and WH domains interlock within the ORC body 


The ORC structure shows that the complex forms a lopsided, two-tiered 
ring with a cashew-shaped protuberance off of one edge (Fig. 1b, cand 
Supplementary Video 1). Orcl-5 comprise the ring body, which bears 
a prominent central channel, while a large domain insertion in Orc3 
forms a bi-lobed, «-helical extension that engages a short o-helix formed 
at the Orc6 C terminus. In contrast to models based on prior electron 
microscopy (EM) reconstructions'*”’, the AAA+ subunits are arranged 
in the order of Orcl-Orc4—Orc5-Orc3-Orc2, thus revising the place- 
ment for Orc2 and Orc3 within the pentameric ORC ring (Fig. 1b). 
In the structure, Orcl-5 each comprises one AAA+ fold, followed 
bya single C-terminal WH domain (Extended Data Fig. 3). The AAA + 
and WH regions co-associate, but are segregated between the two ring 
tiers (Fig. 1b, cand Supplementary Video 1). Interestingly, the collar of 
WH domains is rotationally offset from the AAA+ domains, leading 
to a domain-swapped organization wherein, apart from Orc2, the WH 
domain of one subunit packs against the AAA+ domain ofits adjoining 
partner (Fig. 1c). Domain swapping is facilitated by long linkages between 
the AAA+ and WH modules, a region known to be conformationally 
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Figure 1 | Structure of Drosophila ORC. a, Domain organization of ORC 
subunits. Dashed lines demarcate the ORC core used for crystallization 
(TFUB and CTD, transcription factor-I-like and C-terminal domains in Orc6; 
BAH, bromo-adjacent homology domain in Orc1). b, Crystal structure of ORC. 
Domains of individual subunits are coloured as in a. c, Side view of ORC 
(surface) highlighting the two-tiered, domain-swapped organization of the 
ORC body (Orcl AAA+ and Orc2 WH domains, as well as N-terminal Orc2 
residues (built as polyalanine) are not shown). Exploded view (cartoon) 
showing the packing of WH domains against adjacent subunits’ AAA + regions. 


flexible in archaeal Orc homologues (Extended Data Fig. 3)’*'°. Other 
deviations from archetypal AAA+ structures include the absence of a 
small a-helical ‘lid’ subdomain in the AAA+ fold of Orc2, a larger- 
than-normal lid in Orc4 and a complete domain insertion in the lid of 
Orc3; the last augmentation forms the protuberance that binds Orc6 
(Extended Data Figs 3 and 4 and Supplementary Discussion). Globally, 
the interdigitated interactions between the WH and AAA+ collars, as 
well as the extensive protein-protein contacts between adjoining WH 
domains, indicate that the WH domains are an important determinant 
of ORC stability (5,660 A’ of total surface area are buried between all 
WH domains, and 15,052 A? between the WH and AAA+ tiers). 


DNA binding elements in the WH domain collar 


In archaeal Orc homologues, the WH element recognizes origin sequences 
by using a helix—turn-helix motif and a B-hairpin ‘wing’ to interact with 
the adjacent major and minor grooves of double-stranded DNA'®!7707 
(Fig. 2a). Given the conservation between archaeal Orcs and eukaryotic 
Orcl1-5 proteins, we anticipated that the WH domains of ORC would 
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bind DNA in a similar manner. However, the second o-helix of the 
helix—turn-helix in the WH domain (corresponding to the DNA ‘rec- 
ognition helix’) is buried against the AAA + tier in all subunits but Orc2 
(Fig. 2b). This arrangement leaves the B-hairpin wings of Orcl, Orc4, 
Orc5 and Orc3 solvent exposed, which in turn co-localize to form a 
portion of the interior surface within the central channel in the ORC 
body (Fig. 2b, c). Given the extensive contacts between the WH and 
AAA-+ tiers, sequestration of the recognition helices seems necessary 
to maintain ORC integrity. Thus, certain aspects of DNA recognition 
by the WH domains of ORC probably differ from the approach used 
by archaeal Orcs. 


Quasi-canonical AAA+ domain packing within ORC 


AAA~+ ATPases tend to oligomerize into closed-ring, open-ring or he- 
lical assemblies*””’. On the basis of phylogenetic, biochemical and EM 
data, ORC has been proposed to follow a similar trend”*"*”***, The ORC 
structure shows that, of the various AAA+ domain interactions, the one 
between Orc4 and Orc5 is most like that seen in typical AAA+ systems, 
whereby the two subunits associate to form a bipartite nucleotide- 
binding site at the inter-protomer interface (Fig. 3a and Extended Data 
Fig. 5a). Metazoan Orc4 manifests conserved Walker A and B ATPase 
motifs (GKT and D(D/E), respectively), which in related AAA+ ATPases 
typically contact both B- and y-phosphates of bound nucleotide anda 
catalytically important Mg”* ion. Interestingly, Orc5 donates a well- 
conserved ‘arginine-finger (Arg144) into the Orc4 active site (Extended 
Data Fig. 5a, b). Moreover, although Orc4 does not possess a typical 
‘Sensor IP arginine (an amino-acid motif in AAA+ ATPases that often 
aids hydrolysis), its Walker A region does retain a conserved basic amino 
acid (Arg58) (Extended Data Fig. 5c) that occupies an analogous position 
(Extended Data Fig. 5a). Overall, the observed structural organization 
and conservation of catalytically important residues in the Orc4eOrc5 
interface raises the possibility that, in some organisms, Orc4 might be 
able to support some level of ATP turnover, conformationally modu- 
late ORC function in response to ATP, or require nucleotide to pro- 
mote ORC stability (as has been observed for the human complex”*”*). 
Any such activity probably varies between species, however, as Saccha- 
romyces cerevisiae Orc4 neither contains a conserved Walker A motif 
nor has been found to bind/hydrolyse ATP”. 

Other AAA+/AAA-+ interactions within ORC deviate to a greater 
or lesser extent from the canonical packing arrangements exemplified 
by Orc4eOrc5. For example, the Orc5eOrc3 interface is relatively open, 
with fewer contacts between AAA+ modules (Fig. 3a and Extended 
Data Fig. 5d). Although Orc5 possesses a canonical Walker A motif in 
most eukaryotes, the same is not true for its Walker B motif, nor is there 
a candidate Orc5 Sensor II residue or an Orc3 arginine finger. These 
structural features are consistent with biochemical observations show- 
ing that Orc5 binds ATP but lacks ATP hydrolysis activity°*°. By 
comparison, residues for nucleotide binding and hydrolysis are not 
conserved at the Orc3eOrc2 AAA+/AAA+ interface; hence, the Orc2 


Figure 2 | Eukaryotic ORC and archaeal Orc WH 
domains. a, WH domain—DNA interactions in 
archaeal Orcl-1 (PDB accession number 2QBY 
chain A)'*. The WH domain (grey) uses a helix— 
turn-helix (black and white) motif and a B-hairpin 
wing (cyan) motif to engage DNA (tan surface). 
b, Disposition of WH domains in ORC. The ORC 
WH domains (excepting Orc2) form a collar in 
which the recognition helices (black) are buried 
and the B-hairpin wings form an exposed portion 
of the central channel (the Orcl AAA+ domain is 
not shown). The helix-turn-helix motifs and 
B-hairpin wings of ORC subunits are coloured as 
in a. c, Schematic of the helix—turn-helix and 
B-hairpin motifs in ORC. 
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and Orc3 AAA+ domains and their interactions appear to play a pre- 
dominantly structural part in ORC assembly. 

Consistent with its blend of typical and atypical local AAA+/AAA+ 
interactions, the global organization between the ORC AAA+ domains 
is also mixed. Of AAA+ proteins, ORC is most closely related phylo- 
genetically to other replication initiators, including bacterial DnaA, and 
to DNA polymerase clamp loaders’. In comparing the ORC ATPase 
assembly with these systems, it can be seen that the Orc4*Orc5 and 
Orc3*Orc2 AAA + folds are organized similarly to protomers in ATP- 
assembled DnaA (Fig. 3a). By contrast, the AAA+/AAA~+ organiza- 
tion at the Orc5eOrc3 interface is relatively open. Consequently, the 
clean helical symmetry observed in ATP-assembled DnaA oligomers” 
is broken within the Orc5*Orc3 junction of ORC, a configuration that 
creates a cracked ring-like architecture reminiscent of clamp loaders** 
(Fig. 3b-d). Despite the somewhat more planar arrangement of ATPase 
folds in ORC, the Orc2-5 initiator specific motifs (or ISMs’, an ot-helical 
modification that both distinguishes AAA + -family replication initiators 
and binds DNA directly*’*!”**"”) nonetheless cluster together, form- 
ing a shallow, quasi-spiral shaped set of ‘threads’ that line the interior 
of the ~20 A wide central ORC channel (Fig. 3b). Thus, the structural 
features of ORC are a hybrid of both clamp loader and prokaryotic 
initiator systems. 


An unanticipated Orcl conformation 


In addition to Orc4 and Orc5, one other ORC subunit also known to 
bind ATP is Orcl (refs 27, 30). Indeed, Orcl serves as the major source 
of ATPase activity in ORC and requires a conserved arginine residue 
from Orc4 for catalytic activity” ~’”’. Given this activity, we expected, as 
with Orc2-Orc5, to see relatively canonical AAA + interactions between 
Orcl and Orc4 in the structure. Surprisingly, the AAA+ domain of 
Orcl is completely disengaged from Orc4, owing to a buckling of the 
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Figure 3 | AAA+/AAA+ domain interactions in 
ORC. a, Pairwise AAA+ interactions in the 
Orc2-Orc5 oligomer compared with canonical 
AAA-+ interactions in an ATP-bound DnaA dimer 
(PDB accession number 2HCB”). Superpositions 
were performed using the AAA+ domain of the 
left-most protomer of a given ORC subunit pair 
and DnaA homodimer. b-d, Comparison of 
AAA+ domain assemblies (to scale) for Orc2-5 
(b), ATP-assembled DnaA (PDB accession 
number 2HCB”) (c) and replication factor C (RFC, 
PDB accession number 1SXJ*’) (d). Upper panels 
depict side views, with the purple AAA+ domains 
of DnaA and RFC co-oriented as per the Orc4 
AAA~+ region. Lower panels show views down 
the central channel of ORC and REC, or slightly 
offset from the helical axis of DnaA. The initiator 
specific motifs (ISMs, red) of Orc2-5 and DnaA are 
highlighted (RFC lacks this element). Only the 
AAA+ domains of ORC, DnaA and RFC are 
shown; the RFC-A subunit is omitted. Colouring 
for ORC is maintained as in a, with sequential 
subunits in DnaA and RFC coloured accordingly. 


Orc3-Orc2 


linker at a single region between the Orcl ATPase and WH folds (resi- 
dues 819-821) that gives rise to a ~105° out-of-plane rotation from 
the Orc2-5 AAA+ domains (Fig. 4a). Although this Orcl AAA+/WH 
domain juxtaposition is unique compared with other subunits in the 
structure (Extended Data Fig. 3), it is similar to one of the conforma- 
tions adopted by an ADP-bound archaeal Orc homologue, Aeropyrum 
pernix Orc2 (ref. 18) (Extended Data Fig. 6a, b). Notably, the movement 
of Orcl does not considerably affect the relative arrangement of its two 
AAA+ subdomains, which is maintained as in Orc3-5 and archaeal 
Orcs (Extended Data Fig. 6c—e). The en bloc reorientation of Orcl appears 
stabilized by contacts between the Orcl AAA+ domain and the WH 
domains of Orcl, Orc2 and Orc3, together burying a total of 4,256 A? 
at the interface (Fig. 4b and Extended Data Fig. 6f-h). 

A consequence of the disposition of Orcl within the complex is that 
its nucleotide binding cleft resides ~40 A away from the arginine finger 
of Orc4. Importantly, comparison of the crystal structure with a prior 
three-dimensional EM reconstruction of ATPyS-bound Drosophila 
ORC" shows excellent agreement between the two models (Fig. 4c 
and Supplementary Video 2), indicating that the Orcl conformation 
in the crystal corresponds to the predominant state of the complex in 
solution. Moreover, co-crystallization of ORC with the ATP analogue 
ATPYS, while showing clear density for nucleotide binding to the Orcl, 
Orc4 and Orc5 AAA+ folds (Extended Data Fig. 7a-c), recapitulates 
the configuration seen in apo-ORC. Together, these data indicate that 
Drosophila Orcl must undergo a large structural change to support 
ATPase activity, but that ATP binding is itself insufficient to drive such 
a rearrangement in most ORC particles. 


Implications of the structure for DNA binding by ORC 


In the ORC structure, the central channel within the body of the com- 
plex is formed by both the Orc2-5 ISMs and the B-hairpin wings of 
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Orcl and Orc3-5. In archaeal Orcs, these two elements both bind to 
duplex DNA’*””. To investigate if the ORC central channel could accom- 
modate any of the DNA interactions typified by archaeal Orcs, we super- 
posed the DNA-bound crystal structure of Sulfolobus solfataricus Orcl-1 
(ref. 16) onto Orc4 (after Orcl, Orc4 is most closely related to archaeal 
Orcs). Notably, superpositioning of the AAA+ domains of the two 
proteins (Fig. 5a) resulted ina placement for DNA that aligns the duplex 
coaxially with the central ORC channel (Fig. 5b). Inspection of the 
resultant ORC*DNA model not only reveals that the quasi-spiral formed 
by the Orc2-5 ISMs approximates that of the docked duplex, but that 
the B-hairpin wings of the Orcl and Orc3-5 WH domains also reside 
in a position where they can access the nucleic-acid segment (Fig. 5b). 
The superpositioning between DNA-bound archaeal Orcl-1 and 
the DNA-free Drosophila ORC imaged here has important implications 
for understanding how the eukaryotic initiator engages origin regions. 
One is that ORC probably binds DNA using a mechanism similar to 
that of sliding clamp loaders, which encircle primer-template junctions**”’. 
This binding mode is congruent with a recent proposal based on EM 
analysis of an ORC*Cdc6eCdtl*MCM2-7 complex, which posits that 
ORC helps align the ring ofan MCM2-7 complex around DNA”. The 
apparent sequence-specificity of DNA binding by S. cerevisiae ORC? 
(in contrast to metazoan ORC**”’) probably results from specific inter- 
actions between amino-acid side chains in the ORC channel and nucle- 
otide bases in specific autonomously replicating sequence elements. 


a b 
@ Orc4 @Orct1 @ Orc4 
S. solfataricus Orc1-1 @Orc2 ~@ Orcd 
@ DNA @Orc3  @ Orcé 


@DNA 


B-hairpin wings 
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Electron microscopy: ATPyS 
Drosophila ORC 


Figure 4| An unanticipated but 
naturally occurring Orcl 
conformation. a, The Orcl AAA+ 
domain is disengaged from Orc4 and 
sits above the plane of the Orc2—Orc5 
AAA+ oligomer. Only the AAA+ 
domains of Orcl-5 and the WH 
domain of Orcl are depicted; the 
hinge point between the Orcl AAA+ 
and WH regions is indicated. b, The 
Orcl conformation is stabilized by 
contacts (asterisks) between its 
AAA+ domain and the WH 
domains of Orcl-3 (the C-terminal 
helix of Orcl is in dark blue). See 


@ Orci also Extended Data Fig. 6f-h. 

@ Orc2 c, Docking study showing that the 

© Orc3 ORC crystal structure matches the 

@ Orc4 three-dimensional EM volume 

© Ores of Drosophila ORC (EMD-2479 

es (ref. 14)). See also Extended Data 
an Fig. 8c and Supplementary Video 2. 
db 


At present it is unclear from the structure why Drosophila ORC 
prefers to bind negatively supercoiled DNA over linear segments”, 
a preference also reported for Schizosaccharomyces pombe ORC™. 
Archaeal Orcs are known to underwind DNA upon binding’*””, sug- 
gesting that the eukaryotic ORC AAA+ and WH domains may coop- 
erate to do likewise; topological changes in DNA structure induced by 
S. pombe and human ORC are consistent with such an interpretation**”». 
Alternatively, if ORC were to bind B-form DNA, conformational tran- 
sitions within the ORC body and its associated DNA-binding elements 
would be required to accommodate the DNA duplex. In this instance, 
specificity for negatively supercoiled substrates could arise from the 
relative positioning of the TFIIB-like DNA-binding elements in Orc6 
(refs 10, 11, 40) and the DNA-recognition elements in the central ORC 
channel. 


ORC activation and implications for MCM2-7 loading 


Modelling indicates that the DNA binding elements in the central ORC 
channel can encircle a DNA duplex; however, certain features of the 
complex would appear to preclude Drosophila ORC from doing so in 
the state seen both here and by EM. For example, passage of the DNA 
through the entirety of the central channel is prevented by a constric- 
tion formed by both the Orc2 WH domain and the Orcl AAA+ fold 
(Fig. 1b). Similarly, although the ATPase region of the ORC ring is 
cracked open, the Orc2 WH domain and the Orcl AAA + fold occlude 


Figure 5 | The central channel in ORC probably 
binds DNA. a, Structural alignment of Orc4 onto 
DNA-bound S. solfataricus Orcl-1 (PDB accession 
number 2QBY chain A'*). b, Reciprocal 
superposition of DNA-bound archaeal Orcl-1 
onto Orc4 in the context of ORC aligns the DNA 
duplex with the central channel of ORC. The ISMs 
of Orc2-5 and B-hairpin wings of Orcl and Orc3-5 
(surface) form two semicircular, quasi-spiral 
collars around the docked DNA (Orcl AAA+ and 
Orc2 WH domains are omitted; the remainder of 
ORC is shown as transparent cartoons). c, The 
DNA path (tan ribbon) through the central 
channel is partly blocked by the Orcl AAA+ 
(orange cartoon) and the Orc2 WH (green 
cartoon) domains. The remainder of ORC is shown 
as surface representation. Colouring as in b. 
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this crack, thereby preventing the lateral entry of DNA into the central 
channel from the side of the complex (Figs 1b and 5c). This observa- 
tion probably helps explain the weak effect that nucleotide has on DNA 
binding by Drosophila ORC purified from fly embryos or recombinant 
sources****, It is interesting to note that by preventing DNA binding, 
the placement of the Orcl ATPase region and the Orc2 WH fold also 
blocks the known interaction site for another critical component of 
replication initiation, Cdc6 (refs 23, 24). Overall, the simplest interpre- 
tation of the ORC conformation imaged here is that it corresponds to a 
naturally autoinhibited form of the complex, and that in some organ- 
isms, only a fraction of the total ORC pool that can be obtained from 
asynchronously dividing cells may be capable of productively altering 
its interactions with DNA in response to ATP. 

If Drosophila ORC first assembles into an inactive form, then what 
manner of transition might push the complex into a new state in which 
its ATPase region is now competent to bind DNA (or Cdc6)? Insights 
into a simple structural rearrangement that could support such a switch 
can be gleaned from what is known about archetypal AAA+ ATPase 
organization. Using the Orc4*Orc5 interaction seen in the crystal struc- 
ture as a template, we generated a model for the expected arrangement 
of a functional OrcleOrc4 AAA+ ATPase centre by swivelling the 
Orcl AAA+ fold around a single hinge point in the linker region just 
before its WH domain (Supplementary Video 1). The resultant model 
not only restores expected AAA + interactions between the Orcl active 
site and the Orc4 arginine finger (Fig. 6A and Extended Data Fig. 7d), 
but also both removes the Orcl-mediated blockage of the putative path 
for DNA in the central channel and co-aligns the Orcl ISM with the 
ISM helix formed by Orc2—Orc5 (Fig. 6B and Supplementary Video 1). 
Docking of the rearranged model into the cryo-EM densities of S. 
cerevisiae ORC” shows a reasonable fit for Drosophila ORC contain- 
ing the repositioned Orcl AAA+ domain, and further reveals that a 
region of EM density, which extends from the centre of the ORC body, 
actually corresponds to DNA (Extended Data Fig. 8a, b). In accord 
with a two-state model, the EM density for S. cerevisiae ORC cannot 
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accommodate the AAA+ domain of Orc] in the state imaged crystal- 
lographically (Extended Data Fig. 8a), nor can the Drosophila ATPyS- 
ORC EM volume” account for a remodelled Orcl conformation for 
ORC (Extended Data Fig. 8c and Supplementary Video 2). 

Collectively, the structure and analysis presented here helps to define 
a framework for understanding how Drosophila ORC interfaces with 
its partner proteins and DNA during the initial stages of MCM2-7 
loading (Fig. 6C). In this scheme, ORC would start offin an ATP-bound 
but autoinhibited form—either dissociated from chromatin or in a 
chromatin-bound state via secondary binding sites/partners—that is 
restricted in its ability to either bind DNA within its central channel or 
bind Cdc6 to its ring. Conversion of this state into an activated con- 
figuration would involve the en bloc movement of the Orcl ATPase 
domain (Supplementary Video 1), allowing Orcl to engage the argi- 
nine finger of Orc4, and unlatching the Orc2 WH domain to open a 
gap in the Orcl-5 ring. Once open, DNA would bind to the ISM and 
B-hairpin elements in the central ORC channel, after which Cdc6 would 
dock into the Orcl/Orc2 gap, trapping DNA within the centre of the 
complex. One prediction of this model is that Cdc6 should bind to ORC 
using its ATPase centre to engage an arginine finger in Orcl (Drosophila 
residue Arg734). After formation of a ternary ORC*eDNA*Cdcé6 com- 
plex, the WH domains of Cdc6 and Orc2 would be expected to engage 
the AAA+ folds of Orcl and Cdc6, respectively, creating a circuit of 
WH domains (and their associated B-hairpin elements) to help lock 
the complex into place. Interestingly, these structural findings and 
analyses indicate that, rather than using its AAA+ domains to bind 
MCM2-7 (ref. 24), which requires an inverted order of ATPase site- 
arginine finger interactions around the ORC ring (Extended Data 
Fig. 9a, b), ORC probably uses its WH domain collar instead (Extended 
Data Fig. 9c). 

A major question still remaining is what event might trigger ORC 
rearrangement, or why ORC should exist in an autoinhibited state. On 
the basis of our EM and co-crystallization data, ATP binding alone is 
incapable of efficiently driving such a transition for most of the particles 


A B @Orci1 @Orc3 @Orc4 Figure 6 | Model for ORC activation and its 
1-AAA+ @ Orc2 © Ore5 functional consequences. A, An approximate 
autoinhited © Orc6é 105° rotation of the Orcl ATPase fold about a 


activation 
Ca b c 
Autoinhibited (ATP-bound) ORC 
Insertion WH 2a Nucleotide-dependent 
Activation DNA binding 


TEIBY’ 


MCM2-7- Cdt1 
NTD 


@ DNA hinge point within the AAA+/WH linker region 


juxtaposes the Orcl active site with the arginine 
finger surface of Orc4 (only Orcl, Orc4 and the 
Orc5 AAA+ domain are depicted, both as a 
molecular model and as a schematic). 

B, Repositioning of Orcl extends the circuit of 
ISMs around the prospective DNA binding site of 
the central channel. The Orc2 WH domain is 
omitted; DNA is modelled as per Fig. 5 and shown 
as surface. C, Model for ORC activation and 
subsequent Cdc6 and MCM2-7 recruitment/ 
loading. a, Prospective recruitment of 
autoinhibited Drosophila ORC to DNA by 
elements such as the TFIIB domain of Orc6. 

b, Activation of ORC promotes productive 
OrcleOrc4 interactions and Orc2 WH domain 
repositioning, exposing a gap in the ORC ring. 

c, DNA binds to the central channel of ORC 
through ISM/B-hairpin wing interactions. d, Cdc6 
binds between Orc2 and Orcl, trapping DNA 
inside the complex. e, ORC*Cdcé6 binds to and 
loads one MCM2-7 hexamer (aided by Cdt1) using 
the WH domain face of ORC*Cdc6. During 
loading, two MCM2-7 particles are assembled into 
a double hexamer*”*. 
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in a purified ORC preparation. Interestingly, phosphatase treatment 
of Drosophila ORC stimulates DNA binding”, suggesting that removal 
of one or more post-translational marks might help convert ORC to an 
active form. Moreover, metazoan ORC associates with chromatin in a 
cell-cycle-dependent manner (reviewed in refs 1, 42), and the targeting 
of ORC to chromosomes in metazoans (and in fission yeast) is known 
to require protein-DNA contacts distinct from those in budding yeast, 
such as the TFIIB-homology domain in metazoan Orc6 (refs 10, 40) or 
the AT-hook in S. pombe Orc4 (ref. 43). The action of these elements, 
together with the recognition of nucleosomes by the N-terminal BAH 
domain of Orc] (refs 44-46), suggest that the formation of nucleotide- 
dependent contacts between the ATPase region of ORC and DNA may 
take place after the formation of initial ORC*chromatin encounter com- 
plexes. Moreover, metazoans need to stockpile ORC in oocytes, yet keep 
this pool of ORC from prematurely initiating DNA replication before 
fertilization; hence, ORC may be maternally stored in an inactive form 
during oogenesis. In these contexts, autoinhibition could provide a novel 
mechanism for regulating ORC’s productive association with DNA in 
a cell-cycle-dependent and/or developmental stage-specific manner. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

ORC construct design for crystallization. Initial crystallization trials with full- 
length Drosophila ORC did not yield crystals. To obtain a crystallizable complex, 
we therefore identified regions in ORC subunits likely to be unconserved, disor- 
dered or flexibly tethered to ORC by multiple sequence alignments and EM“. On 
the basis of these analyses, we designed a Drosophila ORC construct lacking the 
N-terminal regions that precede the AAA+ domains of Orcl (amino acids 1-532), 
Orc2 (amino acids 1-265) and Orc3 (amino acids 1-46). These truncated subunits 
were found to assemble into a stable hexameric complex and yielded two-dimensional 
class averages very similar to full-length ORC when analysed by negative-stain EM, 
indicating that removal of the termini does not affect overall ORC architecture 
(Extended Data Fig. 1). In addition, we previously found that the TFIIB-like domain 
of Orc6 is flexibly tethered to ORC via the Orc6 C-terminal domain”; this region 
was also removed for ORC crystallization. All biochemical experiments described 
in the present paper were performed with this ‘trimmed’ ORC construct. 
Cloning and baculovirus generation. ORC subunits Orcl to Orc5 (Orcl: amino- 
acid residues 533-924; Orc2: amino-acid residues 266-618; Orc3: amino-acid resi- 
dues 47-721; Orc4: amino-acid residues 1-459; Orc5: amino-acid residues 1-460) 
were cloned into a pFastBac-derived polycistronic BioBricks vector (University of 
California Berkeley MacroLab). A hexa-histidine (6X His) tag was added to the 
N terminus of Orcl anda maltose binding protein (MBP) tag to the N terminus of 
Orc4, both followed by a tobacco etch virus (TEV) protease cleavage site. The C ter- 
minus of Orc6 (amino-acid residues 187-257) was cloned into a separate pFastBac 
vector. For Orc6 binding and crosslinking experiments, Orc6 (amino-acid residues 
187-257) was cloned with an N-terminal 6X His tag into a ligation-independent- 
cloning (LIC)-compatible pFastBac vector'*. Point mutations (Y225S, A236E 
and A236C) were introduced by site-directed mutagenesis and verified by DNA 
sequencing. 

Bacmids were generated in DH10Bac cells and isolated as per the Bac-to-Bac 
protocol (Invitrogen Life Technologies). Sf9 cells were transfected with bacmid 
DNA using Cellfectin II (Invritrogen Life Technologies), also according to the manu- 
facturer’s instructions. Baculoviruses were amplified twice in Sf9 cells before infect- 
ing large-scale cultures for protein expression. 

Expression and purification of ORC for crystallization. For expression, 81 of 
High5 cells grown in spinner flasks were co-infected with two baculoviruses: the 
multibac virus expressing Orcl-5 and the baculovirus expressing Orc6. Forty- 
eight hours after infection, cells were harvested by centrifugation and ORC was 
purified as described in ref. 14 with a few modifications. Briefly, harvested cells 
were resuspended in 200 ml of lysis buffer (50 mM Tris-HCl (pH 7.8), 300 mM 
KCI, 50 mM imidazole (pH 7.8), 10% glycerol, 200 uM PMSF, 1 pg ml! leupeptin) 
and lysed by sonication. The lysate was clarified by centrifugation, treated with a 
20% (NH4)2SO, precipitation on ice for 30 min, and re-centrifuged. ORC was puri- 
fied by passing the supernatant solution over a 5 ml HisTrap HP nickel-affinity 
chromatography column (GE Healthcare), followed by amylose-affinity chromato- 
graphy using a 20 ml column (New England Biolabs). Purification tags were removed 
from ORC by incubation with 6x His-tagged TEV protease” overnight at 4 °C. 
TEV protease and any uncleaved His-tagged Orcl-containing material was then 
removed by an additional nickel-affinity chromatography step in 50 mM Tris-HCl 
(pH 7.8), 300 mM KCl, 50 mM imidazole (pH 7.8) and 10% glycerol. The flow- 
through was concentrated in 30K Amicon Ultra-15 concentrators (Millipore) and 
purified by gel filtration chromatography using a HiPrep 16/60 Sephacryl S-300 
HR column (GE Healthcare) equilibrated in 50 mM Tris-HCl (pH 7.8), 300 mM 
KCl and 5% glycerol. Peak fractions were pooled and concentrated using 30K 
Amicon Ultra-15 concentrators (Millipore). Protein was used immediately there- 
after for crystallography. 

Crystallization. Before crystallization, ORC was dialysed overnight into crystal- 
lization buffer (50 mM Tris-HCl (pH 7.8), 200 mM KCl, 5% glycerol and 0.5 mM 
TCEP). Crystals grew by sitting-drop vapour diffusion as showers of small plates 
within a few hours upon combining equal volumes of 10 mg ml~' ORC with res- 
ervoir solution (50 mM PIPES (pH 7.5), 100 mM ammonium acetate, 10-35 mM 
MgCl, and 2.5-5% PEG 20,000) at 19-22 °C. These crystals were used as a source 
for streak seeding to obtain larger crystals, which grew to their maximum size 
within 2 days. For cryo-protection, the mother liquor was stepwise exchanged 
(typically two steps for 3h and 15 min, respectively) into 50 mM PIPES (pH 7.5), 
25mM KCl, 10mM MgCl, 50 mM ammonium acetate, 4% PEG 20,000, 10% 
glycerol, 10% ethylene glycol, 20% xylitol and 0.25 mM TCEP. Crystals were then 
harvested by looping and flash-frozen in liquid nitrogen. (NB: during optimiza- 
tion, a subset of crystals was transferred to 4°C and, after several days of equili- 
bration, cryo-protected and harvested at this temperature; this procedure seemed 
to slightly increase the number of usable crystals.) For phasing, crystals were soaked 
into cryo-protecting solution containing 1 mM GdCl; for 3-4 h before harvesting. 
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Additional heavy-atom-derivatized crystals were obtained by incubating crystals 
in cryo-protecting solution containing 100 uM ethyl mercuric phosphate for 3 h; 
data from these soaks were used to identify metal-binding sites for confirming amino- 
acid registers but were not used for phasing. 

Crystallization and crystal harvesting for ATPyS-bound ORC was performed 
as described for apo-ORC but with the following modifications: (1) ORC was 
dialysed into crystallization buffer containing MgCl, (50 mM Tris-HCl (pH 7.8), 
200 mM KCl, 5% glycerol, 0.5 mM TCEP, 5 mM MgCl,); (2) before crystallization, 
ATPYS was added to ORC toa final concentration of 1 mM; and (3) for harvesting, 
0.5mM ATPYS was added to the cryo-protectant to prevent dissociation of the 
nucleotide. 

Data collection and structure determination. The diffraction quality of indi- 
vidual crystals varied greatly, necessitating the screening of many hundreds of 
different crystals to identify acceptable targets for data collection. Crystal screen- 
ing was performed at beamlines 8.3.1 at the Advanced Light Source at Lawrence 
Berkeley National Laboratory, X25 at the National Synchrotron Light Source at 
Brookhaven National Laboratory and 23-ID-B at the Advanced Photon Source 
at Argonne National Laboratory. Native diffraction data sets (A = 1.0332 A) as 
well as single-wavelength anomalous dispersion (SAD) data sets for gadolinium 
(A = 1.71083 A) and sulphur (A = 1.7712 A, see below) were collected at the Advanced 
Photon Source 23-ID-B equipped with a MAR CCD detector. Although complete 
data sets were typically obtained by exposing multiple regions within a single 
crystal using the ‘vector data collection’ option to minimize radiation damage, 
the best native data set was collected at a single site from a crystal harvested at 4 °C. 
Data sets of ethyl mercuric phosphate-derivatized crystals were collected at the 
National Synchrotron Light Source X25 (A = 1.006 A) ona Pilatus 6M detector. 

Diffraction data were processed with XDS**". Merging with the program AIMLESS 
of the CCP4 software package”? (Extended Data Table 1) indicated that the crystals 
belonged to space group 1222, with unit cell dimension of a = 145.5 A,b=259.0A 
and c = 257.0 A for the best native crystal. The Gd-SAD data set was obtained by 
merging data from four different crystals. Despite slight non-isomorphism between 
crystals, merging data from multiple crystals substantially improved the anomalous 
signal, phases and electron-density interpretability compared with data sets col- 
lected from single crystals. 

For initial phasing by SAD, gadolinium sites were identified with SHELXD”. 
The strongest sites were then used as input into PHASER™ as implemented in 
PHENIX* to find additional sites and to obtain initial phases to ~4 A. Maximum 
likelihood density modification with RESOLVE” was used to break the phase ambi- 
guity and to improve electron-density maps. At this point, experimental phases 
were next applied (again using PHENIX) to the native data set, which was of better 
quality than the gadolinium derivative. The resulting electron-density maps (at 
4A resolution) allowed identification of all five AAA+ domains and four of the 
five WH domains, and revealed density for bulky side chains as expected for this 
resolution. In parallel to data processing with SHELXD/PHASER/RESOLVE, exper- 
imental phases and density maps were also calculated with SHARP and improved by 
solvent flipping in SOLOMON”. Although slightly less featured, the SOLOMON 
electron-density maps were overall very similar to those obtained from RESOLVE 
but additionally revealed clear protein density in some regions that were poorly 
defined in RESOLVE density maps. Thus, while model building was performed 
predominantly into RESOLVE density maps, SOLOMON density maps were used 
as an additional guide to trace the main chain of the model. Phases were gradually 
improved by iterative cycles of model building, density modification with phase 
combination of experimental and model phases in RESOLVE, and phase extension 
to3.7A. 

Model building was initiated by automated searches using MOLREP**” and by 
manual docking using UCSF Chimera" to place the AAA+ and WH domains 
of archaeal Orc1/Cdc6 (PDB accession numbers: 2QBY"*; 1FNN’’) and/or homo- 
logy models for Drosophila ORC subunits (as generated by Phyre2 (ref. 62)) into 
electron-density maps. These docked structures were valuable reference points 
and facilitated tracing of most of the main chain; insights into likely subunit posi- 
tions (from low-resolution EM studies of Drosophila ORC"), together with knowl- 
edge of the domain architecture of ORC subunits (from sequence predictions), 
allowed assignment of specific subunits to map density regions. Using COOT®, a 
nearly complete model of ORC was manually built de novo into phase-combined 
and B-factor-sharpened RESOLVE density maps, guided by the topology of archaeal 
Orc1/Cdc6 AAA+ and WH domains, as well as by secondary structure prediction 
and multiple sequence alignments. The initial model was improved by iterative 
rounds of refinement in PHENIX (real-space, individual xyz, individual atomic 
displacement parameters), using secondary structure and (in early stages of build- 
ing) experimental phase restraints, as well as stereochemistry and atomic displace- 
ment parameter weight optimization; subsequent rounds of model rebuilding 
were performed using COOT. During the course of refinement, a slightly higher- 
resolution and more complete native data set (to 3.5 A) was obtained and used for 
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the final rounds of refinement, which also included refinement of TLS parameters. 
The final model contains the AAA+ and WH domains of Orcl to Orc5, the Orc3 
domain insertion and the conserved C-terminal helix of Orc6; an N-terminal 
region of Orc2 (preceding the AAA+ domain) was built as a poly-alanine model, 
since the amino-acid register for this region could not be assigned unambiguously. 
The final model was validated with MOLPROBITY™ and has excellent geometry 
(MolProbity score 1.88), with no Ramachandran outliers and only a small fraction 
(1.9%) of rotamer outliers (Extended Data Table 1). 

During the course of model building, several approaches were used to validate 
the sequence register of the various ORC chains. These included (1) using Hg- 
binding sites in ethyl mercuric phosphate-derivatized crystals as fiducials for 
cysteines and histidines, (2) using sulphur sites in native S-SAD data sets to verify 
the location of a subset of cysteines and methionines (the weak signal present in 
these data precluded the use of this information for phasing) and (3) conducting 
Orc3-Orcé6 crosslinking experiments to confirm the register of the Orc6 C-terminal 
helix (Extended Data Fig. 4h). Hg and S sites were identified from log-likelihood- 
gradient maps calculated using the MR-SAD option in PHASER™. Additionally, 
MR-SAD for sulphur sites also revealed the position of two ions with anomalous 
scattering properties at the wavelength of data collection (A = 1.7712 A). These 
ions showed clear density in experimental and 2F,, — F. maps and were interpreted 
as a chloride ion in the P-loop of Orc5 and as a potassium ion bound to Orc2. 

Once a satisfactory apo-ORC model was obtained, it was used as a search model 
for molecular replacement (using PHENIX-PHASER) to phase data collected 
from ORC co-crystallized with ATPyS. The resulting solution (Z score = 86.7, 
log-likelihood-gain = 7747) revealed clear (more than 2-40) positive difference 
density in the nucleotide binding clefts of Orcl, Orc4 and Orc5 that could accom- 
modate ATPyS (Extended Data Fig. 7a-c). Since the resolution of these crystals 
was limited to 4 A, and since only small structural changes were observed through- 
out the remainder of ORC, we refrained from building and refining a model against 
this data set. 

Structure analysis. Structural superpositions and docking into EM maps were 
performed using UCSF Chimera®*". Buried surface area at domain/subunit inter- 
faces was calculated with PyYMOL (PyMOL Molecular Graphics System, version 
1.7.0.0, Schrédinger). Multiple protein sequence alignments were performed with 
MAFFT®© and conservation scores were calculated and mapped onto the struc- 
ture with Consurf*’. Sequence logos were generated with WEBLOGO™. Figures 
were rendered using both PyMOL (The PyMOL Molecular Graphics System, version 
1.7.0.0, Schrodinger) and UCSF Chimera. 

Expression and purification of ORC1-5 and Orcé6 for binding and crosslinking 
studies. ORC containing subunits ORC1-5 was expressed in High5 cells using a 
single virus with the multibac approach. Expression and purification were per- 
formed as described above for expression of ORC1-6, except that (1) 1mM 
B-mercaptoethanol or 1mM DTT was included in buffers during nickel- and 
amylose-affinity steps or gel filtration chromatography, respectively, and (2) 10% 
glycerol was maintained in all buffers throughout purification. For crosslinking 
experiments, 6X His and MBP tags were removed from ORC1-5, whereas for 
fluorescence anisotropy and pull-down experiments the affinity tags were left on 
ORC1-5 by omitting the TEV protease cleavage and subsequent nickel-affinity steps. 

The CTD of Orcé6 (residues 187-257) was purified as described previously"*. Briefly, 
6X His-tagged Orc6-CTD“”, Ore6-CTD*****, Orc6-CTD*”**° and Ore6-CTD*”””S 
constructs were expressed in High5 cells and purified by nickel-affinity chromato- 
graphy in lysis buffer (50 mM Tris-HCl (pH 7.8), 600mM KCl, 10% glycerol, 
50 mM imidazole, 1 mM f-mercaptoethanol, 200 1M PMSF and 1 pg ml * leu- 
peptin). During the second of two wash steps, the salt concentration was decreased 
to 300 mM KCL. Protein was eluted with 50 mM Tris-HCl (pH 7.8), 300 mM KCI, 
10% glycerol, 250 mM imidazole and 1 mM f-mercaptoethanol, concentrated in 
3K Amicon Ultra-15 concentrators (Millipore), and further purified on an HiPrep 
16/60 Sephacryl S-200 HR column (GE Healthcare) equilibrated in 50 mM Tris- 
HCI (pH 7.8), 300 mM KCl, 10% glycerol and 1mM DTT. Peak fractions were 
pooled, concentrated in 3K Amicon Ultra-15 concentrators (Millipore) and stored 
at —80°C. 

Fluorescence anisotropy. Orc6 proteins were N-terminally labelled with Alexa 
Fluor 488 5-SDP ester (Invitrogen Life Technologies) as described previously"*. 
Binding reactions were performed for 30 min in a total volume of 140 pl contain- 
ing 30 nM fluorescently labelled wild-type Orc6-CTD, Orc6-CTD*”?°S or Orc6- 
CTD*8*, and different concentrations of ORC1-5 (ranging from 122 pM to 
1 LM in twofold serial dilutions) in a buffer consisting of 50 mM Tris (pH 7.8), 
300 mM KCl, 5% glycerol, 1 mM DTT and 0.1 mg ml | BSA. Forty microlitres of 
each reaction were transferred to 384-well plates in triplicates and fluorescence 
polarization was measured in a POLARstar Omega plate reader (BMG Labtech) 
with a 485 nm excitation filter and a 520 nm emission filter. For data analysis, 
fluorescence polarization measurements were converted into fluorescence aniso- 
tropy units, which ranged from 0.01 to 0.16. The mean fluorescence anisotropy 


values obtained from three (for Orc6-CTD*””°S and Orc6-CTD“**) or six (for 
wild-type Orc6-CTD) independent experiments were fitted to the quadratic 
binding equation: 


[L] + Kp + [R] — vu +Kp +[R])* —4(Z][R] 


A=A¢+(Ap—Ar) 2 


where Ag and Ay are the measured anisotropy of free and bound, fluorescently 
labelled Orc6, respectively, [L] is the concentration of Orc6 used in binding 
assays, [R] is the concentration of ORC1-5 and Kp is the apparent dissociation 
constant. Owing to the limited sensitivity of our plate reader, binding experiments 
were performed at Orc6 concentrations above the Kp for wild-type Orc6 binding 
to ORC1-5; hence, curve fits are meant to aid visualization rather than to model 
the data explicitly. Mean and standard deviations of fluorescence anisotropy values 
were plotted as a function of ORC1-5 concentration after subtracting A, values 
from respective mean fluorescence anisotropy values for visual comparison (Ex- 
tended Data Fig. 4f). 

Crosslinking of Orc6 to ORC1-5. To independently check that the Orc6 
C-terminal «-helix was oriented correctly with respect to Orc3, crosslinking studies 
were performed with either wild-type or an A236C Orc6-CTD mutant and untagged 
ORC1-5 using the homobifunctional maleimide crosslinker BMOE (bis(malei- 
mido) ethane; Thermo Scientific) under non-reducing conditions. Ala236 of Orc6 
was chosen for mutation because inspection of the structure revealed that the insert 
of Orc3 contained a nearby native cysteine (Cys372), which we reasoned could form 
a crosslink with an A236C Orc6 mutant if our build register were correct (Extended 
Data Fig. 4e). Before crosslinking, both wild-type Orc6-CTD and Ore6-CTD*”**° 
proteins, as well as untagged ORC1-5, were dialysed overnight into 50 mM PIPES 
(pH 7.5), 300 mM KCland 10% glycerol to remove reducing agents. Binding of the 
mutant or wild-type Orc6 protein to ORC1-5 was performed in 50 jl reactions 
containing 4 1M of each protein in 50 mM PIPES (pH 7.5), 300 mM KCl and 10% 
glycerol for 30 min. BMOE was then added to reactions to a final concentration of 
0.2 mM for 5 min, after which crosslinking reactions were stopped by adding SDS- 
polyacrylamide gel electrophoresis (SDS-PAGE) sample-loading dye containing 
B-mercaptoethanol (100 mM final concentration). Stopped reactions were analysed 
by SDS-PAGE and Coomassie staining. 

To ensure that Orcé still bound to ORC1-5 under non-reducing conditions, we 
performed binding reactions using ORC1-5 containing MBP-tagged Orc4 and either 
wild-type or A236C Orc6-CTD under non-reducing conditions as described for the 
crosslinking experiments, but instead subjected the reactions to pull-downs using 
amylose beads (New England Biolabs). Beads were washed three times with 1 ml of 
50 mM PIPES (pH 7.5), 300 mM KCland 10% glycerol, after which bound proteins 
were eluted with buffer containing 20 mM maltose and analysed by SDS-PAGE. 
Electron microscopy. Four microlitres of a 30 nM Drosophila ORC solution con- 
taining N-terminally truncated Orcl, Orc2 and Orc3 subunits (in 20 mM Tris 
(pH7.8), 125 mM potassium glutamate, 5 mM MgCls, 1 mM ATPYS) were spotted 
onto glow-discharged, continuous-carbon film EM grids, stained with four drops 
of 2% uranyl formate for 10 s each and blotted. Grids were imaged in a Tecnai T12 
BIOTWINN transmission electron microscope operated at 120 keV with a LaBg 
cathodeas electron source. Data collection, image processing and two-dimensional 
classification were performed as described previously"*, and two-dimensional class 
averages were compared with class averages of the full-length ATPyS-ORC data set 
reported in ref. 14 (Extended Data Fig. 1c). 
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Extended Data Figure 1 | Deletion of variable N-terminal extensions in 
Orcl, Orc2 and Orc3 alters neither ORC stability nor overall ORC 
architecture. a, Gel-filtration chromatography trace of the ORC core used for 
crystallography, with (b) SDS-PAGE of respective ORC peak fractions from 
a, indicates the formation of a stable hexameric complex. ¢, Full-length ORC 
and ORC containing N-terminal truncations display similar structural features 
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in two-dimensional EM class averages. Both complexes were imaged by 
negative-stain EM in the presence of ATPYS. Note that although class averages 
from ORC with truncated Orc1-3 subunits contain full-length Orc6, Orcé is 
not visible owing to its flexible nature'*. Class averages for full-length ORC 
are derived from a data set used in ref. 14. 
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Extended Data Figure 2 | Experimental electron density contoured at 1a for different regions of the ORC structure. The Orcl WH domain is shown ina, the 
Orc3 insertion in b and the Orc4 AAA+ domain in c. 
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Extended Data Figure 3 | Comparison of individual ORC subunits and initiator-specific motif (ISM) of the AAA+ ATPase fold is shown in the 
archaeal Orc structures. Structures of individual ORC subunits are compared __ inset. No electron density was observed for the region linking the AAA+ and 
to S. solfataricus Orc1-1 (PDB accession number 2QBY chain A’®) and WH domains of Orc5 (indicated by a dashed line). The very N-terminal region 


A. pernix Orc2 (PDB accession numbers 1W5S chain A (left) and 1W5T chain — of Orc2, which could only be built as stretches of polyalanine, is not shown. 
C (right)"*). Different structural elements are coloured as indicated. The 
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Extended Data Figure 4 | The Orc3 domain insertion forms a conserved, 
hydrophobic binding platform for Orc6. a, Surface representation of ORC. 
The Orc3 insertion, which extends from the Orc3 AAA+ lid subdomain and 
interacts with the C-terminal helix of Orcé6, is boxed. b, Secondary structure 
representation of the boxed region shown in a. The Orc3 insertion forms a 
bi-lobed, «-helical fold, three helices of which create a binding site for Orc6. 
c, Surface conservation of the Orc3 insertion. Conserved Orc3 residues 
cluster in the region that interacts with the Orc3 lid and in the Orc6 binding 
pocket. The latter region contacts highly conserved residues in Orc6 (Y225 and 
W228). d, Close-up view of Orc3eOrcé6 interactions, showing a primarily 
hydrophobic binding site in Orc3 for Orc6 residues (Y225, W228, M232, 
A236). Y225, which in Drosophila Orcé is equivalent to an amino acid altered in 
a subset of patients with Meier-Gorlin syndrome, appears positioned within 
hydrogen-bonding distance of E354 in Orc3 (dashed line). Colours are as in b. 
e-h, Biochemical validation of the binding register for Drosophila Orcé. 

e, Close-up of the Orc6*Orc3 interface. Orc6—Ala236 faces a hydrophobic 
surface formed by Orc3 residues and is also in close proximity to a natural 
cysteine in Orc3 (Cys372). To validate the register of the short C-terminal Orc6 
helix and the Orc6eOrc3 interface, we mutated Orc6-Ala236 to either 
glutamate, which we hypothesized would impede binding to ORC1-5 because 
of clashes with hydrophobic residues in Orc3, or to cysteine, which we 
presumed would not affect Orc3 binding but would allow site-specific 
crosslinking to Orc3-Cys372. f, Orc6*?°°* has a reduced affinity for the 
ORC1-5 complex. The C-terminal domains (CTDs) of wild. type (WT) Orc6, 
Orc6“?*F or the Meier-Gorlin syndrome equivalent Orc6*””* were each 
N-terminally labelled with Alexa Fluor 488 and tested for ORC1-5 binding 
using fluorescence anisotropy. As shown previously", the C-terminal domain 


of Orc6 binds ORC1-5 with low nanomolar affinity, whereas the Y225S 
mutation strongly reduces binding. As predicted on the basis of the structure of 
the Orc6*Orc3 interface, the A236E mutation also reduces the affinity of the 
Orc6-CTD for ORC1-5. Mean and standard deviations from three (for 
Orc6*??°S and Orc6*”*°*) or six (for wild-type Orc6) independent experiments 
are shown. g, Orc64?°°° is able to bind to the ORC1-5 complex. Orc6-CTD* 
or Orc6-CTD*”*°© were incubated with ORC1-5 (containing MBP-tagged 
Orc4) and subjected to pull-down experiments using amylose resin. Both 
Orc6-CTD™” and Orc6-CTD*”*°© co-purified with ORC1-5. The pull-down 
experiment was performed under non-reducing experimental conditions 
similar to the crosslinking experiment in h. Asterisks mark two likely 
proteolytic fragments of Orc3. h, The Orc6-CTD*?*© mutant, but not 

the wild-type Orc6-CTD, specifically crosslinks to Orc3 within the ORC1-5 
complex. Orc6-CTD“" or Orc6-CTD*?"*, either alone or in the presence of 
ORC1-5, was incubated with a bifunctional maleimide crosslinker and the 
proteins subsequently analysed by SDS-PAGE. In reactions containing 
ORC1-5 and Orc6-CTD****, crosslinking gives rise to a novel band with 
higher molecular mass than Orc3; the appearance of this band correlates with 
a decrease in the amount of uncrosslinked Orc3 and Orc6-CTD, and does 
not appear with reactions containing ORC1-5 and wild-type Orc6-CTD, 
indicating that this species corresponds to an Orc3-Orcé6 crosslink (a 
moderately strong higher molecular-mass band that appears in the absence 
of Orc6 probably corresponds to homotypic adducts between exposed cysteines 
in Orc3). These results are consistent with the structure, which places 
Orc6-Ala236 in close proximity to Orc3-Cys372. Note that ORC1-5 contained 
MBP-tagged Orcé4 in g but that the tag was removed in h. 
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Extended Data Figure 5 | ATP-binding site configuration at the Orc4*Orc5 
and Orc5eOrc3 interfaces. a, Inter-AAA+ interactions between Orc4 and 
Orc5 are similar to canonical AAA+ interactions between DnaA protomers 
(top panel, only Orc4 is used for superpositioning onto the left (light grey) 
AAA+ domain of an ATP-bound DnaA dimer, PDB accession number 
2HCB”). Close-up views of the nucleotide-binding site are shown for Orc4 
(bottom panel) and for DnaA for comparison (middle panel). The resemblance 
of the Orc4 nucleotide-binding pocket to the active site of functional AAA+ 
ATPases is somewhat surprising considering that mutations in the active site of 
Drosophila and human Orc4 have no reported effect on the ATPase activity of 
ORC as measured in vitro”*”°, but may help explain why a Drosophila ORC 
mutant bearing a Walker A or B substitution in Orc4 exhibits modest DNA 
replication defects in extracts”*. b, The putative arginine finger in Orc5 is well 
conserved across homologues. A sequence logo of a multiple sequence 
alignment of the region containing the putative arginine finger (marked with an 
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arrow) in eukaryotic Orc5 protein sequences is shown. Amino-acid 
numberings correspond to the Drosophila Orc5 sequence. c, A potential Sensor 
II equivalent arginine (marked with an arrow) in the Orc4 Walker A motif is 
conserved in eukaryotic Orc4 homologues. A sequence logo of the Walker 

A motif from a multiple sequence alignment of eukaryotic Orc4 protein 
sequences is shown. Amino-acid positions are numbered as in Drosophila Orc4. 
d, Inter-AAA+ interactions between Orc5 and Orc3. The top panel shows a 
superposition derived from placing the AAA+ domain of Orc5 atop the 
AAA+ domain of the left (dark grey) protomer of an ATP-bound DnaA dimer; 
the bottom panel shows a close-up view of the nucleotide-binding site at the 
Orc5eOrc3 interface. Side chains of conserved residues known to be involved in 
nucleotide binding and hydrolysis in AAA+ ATPases are represented as 
sticks in both a and d. WA, Walker A; WB, Walker B; SI, Sensor I; SII, Sensor II; 
RF, arginine finger. 
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Extended Data Figure 6 | The conformation of Orcl arises from a 
reorientation between its AAA+ and WH domains, not from changes 
within the AAA+ ATPase domain itself. a, Superpositioning of the WH 
domains of Orcl and S. solfataricus Orc1-1 (PDB accession number 2QBY 
chain A"*) reveals different conformations for both proteins, resulting from a 
large domain rotation of the Orcl AAA+ domain around a pivot point in 
the linker preceding its WH domain. b, The Orc1 conformation is most similar 
to a state seen for A. pernix Orc2 (PDB accession number 1W5T chain C’*). 
The WH domains of both proteins were superposed as in a. c—e, Superposing 
the AAA+ base subdomains of Orcl and S. solfataricus Orc1-1 (c, PDB 
accession number 2QBY chain A”*), A. pernix Orc2 (d, PDB accession number 
1WST chain C’*) and Orc3, Orc4 or Orc5 (e) shows that the typical AAA+ 


b 
O@ Orc1 
O@A. pernix Orc2 


M817 


AAA+ base 


@ A. pernix Orc2 


AAA+ lid 


AAA+ base 


O Ored 


AAA+ lid 


AAA+ base 


45 

2 

524 EF 
= 


1818 % mk © A 
gy  gY BY gy 9 


amino acid position 


configuration between the base and lid subdomains are maintained in Orcl. 
Only a slight opening of the nucleotide-binding cleft is observed in Orcl, which 
is probably caused by the absence of bound nucleotide. f, g, The most 
C-terminal o-helix of the Orcl WH domain mediates interactions with the 
Orcl lid subdomain. An overview of the interaction is shown in f, with a 
close-up view of contacts between a conserved tyrosine (Tyr915) in the 
C-terminal Orcl helix and a hydrophobic pocket of the Orc] lid depicted in g. 
h, The tyrosine in the C-terminal helix of Orc1 is well conserved across 
metazoan but not fungal Orcl homologues. Alignments are shown as sequence 
logos. The numbering of amino acids is based on Drosophila Orcl, and the 
tyrosine is marked by an arrow. 
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Extended Data Figure 7 | Nucleotide binding by Orcl, Orc4 and Orc5. For —_d, Modelling of canonical AAA+ interactions between Orcl and Orc4, 

a-c, molecular replacement with the apo-ORC model was used to phase generated using the Orc4eOrc5 interaction as a reference. Upper panel: 
diffraction data collected from an ORC-ATPYS co-crystal. Positive Fy — F. structural overview of modelled AAA+ domain positioning between Orcl and 
difference density contoured at different sigma levels reveals clear features for | Orc4. Lower panel: close-up of the modelled Orcl*Orc4 ATPase site. Side 
nucleotide binding to the AAA+ domains of (a) Orcl, (b) Orc4 and (c) Orc5. chains (taken from their place in the apo-ORC model as a reference) are shown 
ATPYS is docked into the difference density for reference; owing to the for conserved catalytically important residues. 

moderate (4.0 A) resolution of the data, this structure was not refined. 
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Extended Data Figure 8 | Comparison of crystallographic and EM models. 
Docking of the observed and remodelled ORC structures into the cryo-EM 
density of S. cerevisiae and Drosophila ORC indicates that the ATPase domain 
of Orcl is repositioned into a canonical AAA+/AAA~+ interaction with 
Orc4 when Cdcé6 is present, and supports a model where DNA passes through 
the central channel in ORC. a, The three-dimensional EM volume for 

S. cerevisiae ORC (as present in a complex with Cdc6, Cdtl and MCM2-7 and 
assembled in the presence of DNA, EMD-5625 (ref. 24)) contains Orcl in the 
activated conformation. ORC with Orc] in the autoinhibited conformation 
(left panel, as observed in the crystal structure) and remodelled conformation 
(right panel, remodelled) were docked into the ORC*Cdc6*Cdt1l*Mcm2-7 
cryo-EM map (only the density for ORC*Cdc6 is shown). The ORC*Cdc6 EM 
density readily accommodates Orcl in the activated conformation, but not in 
its autoinhibited state. The EM density corresponding to Cdc6 is indicated 

in the right panel. b, DNA passes through the central ORC channel in the 
DNA*ORC*Cdc6 complex. ORC (with Orcl in the remodelled conformation) 
was first docked into the cryo-EM map derived from a DNA*ORC*Cdc6 
complex (EMD-5381 (ref. 23)). The AAA+ domain and its associated DNA 
from either S. solfataricus Orc1-1 or Orcl-3 (PDB accession number 2QBY'*) 
were then superposed using the AAA+ domain of Orc4 as a guide. Both 
dockings indicate that a region of density previously assigned to the Orc6 


Orc1 remodelled 


t?? 


subunit” actually corresponds to the DNA duplex. Although superpositioning 
of the AAA+ domain of S. solfataricus Orc1-3 onto Orc4 better positions 
duplex DNA in the observed EM density (than does the comparable exercise 
using the Orcl-1*DNA complex), the curvature of the DNA (as present in the 
Orcl-3eDNA co-crystal structure) results in a greater number of clashes 
between DNA and ORC subunits. Nevertheless, both docking scenarios are 
consistent with a DNA binding mode of ORC where DNA runs through the 
central channel. Note that the handedness of the EM map (EMD-5381 (ref. 23)) 
has been corrected in this figure because it has been reported that the 
original handedness was inverted”. For clarity, the WH domain of Orc2 is 
omitted from the remodelled ORC structure in b. c, The autoinhibited ORC 
conformation observed in the crystal is, unlike the remodelled Orc1 
configuration, similar to the ORC conformation observed in Drosophila ORC 
EM reconstructions. Docking of the ORC crystal structure (top panels) or 
the remodelled activated ORC structure (bottom panels) into a prior three- 
dimensional EM reconstruction of Drosophila ORC (EMD-2479)"* reveals 
excellent agreement between EM and crystal structures, but not between 

EM and modelled activated ORC structures. The poor fit of the remodelled 
Orcl conformation into the EM density suggests that the EM structure 
represents the autoinhibited state of ORC as seen in the crystal, indicating it is 
the predominant state in solution. See also Supplementary Video 2. 
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Extended Data Figure 9 | Docking of the ORC structure into the cryo-EM stimulate Orcl ATP hydrolysis*) points towards the Orc5 nucleotide-binding 
structure of an S. cerevisiae replication initiation intermediate indicates site rather than the appropriate Orcl active site. Schematics for the ATP site 
that ORC recruits the MCM2-7 complex by binding to the ORC WH assemblies of ORC and RFC derived from these structures are shown in the 
domains. A prior model for ORC*-MCM2-7 engagement”, proposed from an _ lower panels in a and b. The location of the WH domain collar of ORC and the 
ORC*Cdc6*CdtleMCM2-7 cryo-EM structure generated in the presence of C-terminal collar of RFC is indicated by a grey circle. c, Docking of the 

DNA (shown in a, EMD-5625 (ref. 24)), used the crystal structure of replication | ORC crystal structure (with Orc] in its remodelled or ‘activated’ conformation) 
factor C (RFC) bound to the sliding clamp PCNA (shown in b, PDB accession _into the cryo-EM map shown ina reveals that the WH domains of ORC face an 


number 1SXJ*') to suggest that the AAA+ domains of ORC engage the MCM2-7 complex. This switched polarity of WH domains and AAA+ 
MCM2-7¢Cdt1 complex. However, using the handedness of the EM volumeas domains in the EM map corrects the ATPase site assembly and is schematized 
reported”, this organization of ORC subunits leads to an inverted ATPase in the right panel. 


site assembly, requiring that the Orc4 arginine finger (which is known to 
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Extended Data Table 1 | Summary of data collection, phasing and refinement statistics 


Native GdCls soak Native - ATPyS 
Data collection: 
Beamline APS 23ID-B APS 231D-B NSLS X25 
Wavelength (A) 1.0332 1.71083 11 


Resolution range (A) 
Space group 
Unit cell dimensions 
a, b, c (A) 
a, B, y (°) 
Reflections 
Total 
Unique 
Completeness (%) 
Multiplicity 
Mean I/o 
CCi2 
Rmerge 
Ropim 
Wilson B factor 


Phasing: 
FOM before density modification 
FOM after density modification 


Refinement: 
Rwork/Riree 
Number of atoms 
Protein 
Ligands 
B factors 
Mean 
Macromolecule 
Ligands 
Root mean square deviation 
Bond lengths (A) 
Bond angles (°) 
Ramachandran plot 
% favored 
% allowed 
% outliers 
Molprobity 
Clashscore 


Highest resolution shell is shown in parenthesis. 


48.8-3.5 (3.59-3.5) 
1222 


145.5, 259.0, 257.0 
90, 90, 90 


518,695 (38,421) 
61,418 (4,491) 
100.0 (100.0) 
8.4 (8.6) 

7.7 (1.1) 

0.995 (0.427) 
0.214 (2.181) 
0.078 (0.788) 
118.6 


0.2243/0.2581 


16336 


50.23-3.96 (4.13-3.96) 


1222 


145.3, 257.7, 256.1 
90, 90, 90 


2,359,387 (209,066) 
40,955 (3,602) 

97.3 (76.2) 

57.6 (58.0) 

23.4 (1.9) 

0.999 (0.745) 

0.355 (3.857) 

0.047 (0.509) 

165 


0.27 
0.64 
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58.11-4.0 (4.16-4.0) 
1222 


144.2, 255.3, 261.1 
90, 90, 90 


554,356 (62,459) 
41,025 (4,568) 
100.0 (100.0) 
13.5 (13.7) 

16.6 (1.3) 

0.999 (0.678) 
0.095 (2.174) 
0.027 (0.606) 
197.8 
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A dusty, normal galaxy in the epoch of reionization 


Darach Watson!, Lise Christensen’, Kirsten Kraiberg Knudsen”, Johan Richard*, Anna Gallazzi'* & Michal Jerzy Michalowski? 


Candidates for the modest galaxies that formed most of the stars in 
the early Universe, at redshifts z > 7, have been found in large num- 
bers with extremely deep restframe-ultraviolet imaging’. But it has 
proved difficult for existing spectrographs to characterize them using 
their ultraviolet light’*. The detailed properties of these galaxies 
could be measured from dust and cool gas emission at far-infrared 
wavelengths if the galaxies have become sufficiently enriched in dust 
and metals. So far, however, the most distant galaxy discovered via 
its ultraviolet emission and subsequently detected in dust emission is 
only at z = 3.2 (ref. 5), and recent results have cast doubt on whether 
dust and molecules can be found in typical galaxies at z= 7°*. Here 
we report thermal dust emission from an archetypal early Universe 
star-forming galaxy, A1689-zD 1. We detect its stellar continuum in 
spectroscopy and determine its redshift to be z= 7.5 + 0.2 from a 
spectroscopic detection of the Lyman-a break. A1689-zD1 is rep- 
resentative of the star-forming population during the epoch of 
reionization’, with a total star-formation rate of about 12 solar masses 
per year. The galaxy is highly evolved: it has a large stellar mass and 
is heavily enriched in dust, with a dust-to-gas ratio close to that of 
the Milky Way. Dusty, evolved galaxies are thus present among the 
fainter star-forming population at z> 7. 


Figure 1 | The gravitationally lensing galaxy cluster Abell 1689. The colour 
image is composed with Hubble Space Telescope filters: F105W (blue), F125W 
(green) and F160W (red). The zoomed box (4’’ X 4") shows A1689-zD1. 
Contours indicate far-infrared dust emission detected by ALMA at 3a, 40, 
and 5q local rms (yellow, positive; white, negative). The ALMA beam 


As part of a programme to investigate galaxies at z>7 with the 
X-shooter spectrograph on the Very Large Telescope, we observed the 
candidate high-redshift galaxy, A1689-zD1, behind the lensing galaxy 
cluster Abell 1689 (Fig. 1). The source was originally identified'® as a can- 
didate z > 7 system from deep imaging with the Hubble and Spitzer space 
telescopes, with photometry fitting suggesting that itis atz = 7.6 + 0.4. 
The galaxy is gravitationally magnified by a factor of 9.3 by the galaxy 
cluster’®. Although it is intrinsically faint, because of the gravitational 
amplification, it is one of the brightest candidate z > 7 galaxies known. 
The X-shooter observations were carried out on several nights between 
March 2010 and March 2012 with a total time of 16h on target. 

The galaxy continuum is detected and can be seen in the binned spec- 
trum (Fig. 2). The Lyo cutoffis at 1,035 + 24 nm and defines the redshift 
to be z= 7.5 + 0.2. It is thus one of the most distant galaxies known 
so far to be confirmed via spectroscopy, and the only galaxy at z>7 
where the redshift is determined from spectroscopy of its stellar con- 
tinuum. The spectral slope is blue; using a power-law fit F, « 2%, 
fb = 2.0 + 0.1, where / is the wavelength and F, is the flux per unit wave- 
length. The flux break is sharp, and greater than a factor of ten in depth. 
In addition, no line emission is detected, ruling out a different redshift 
solution for the galaxy. Line emission is excluded to lensing-corrected 


(1.36"’ X 1.15'”) is shown, bottom left. Images and noise maps were 
primary-beam corrected before making the signal-to-noise ratio (SNR) maps. 
Slit positions for the first set of X-shooter spectroscopy are overlaid in magenta 
(dashed boxes indicate the dither), while the parallactic angle was used in 

the remaining observations (pink dashed lines). 
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Figure 2 | Spectrum of A1689-zD1. The binned one-dimensional (middle 
panel) and two-dimensional (upper panel; wavelength versus distance along 
the slit) spectra are shown, with the 68% confidence uncertainty for the 
one-dimensional spectrum in the bottom panel. The redshift z = 7.5 is 
determined from the Lya break at 1,035 nm. Sky absorption (grey bands) and 


depths of 3 x 10” erg cms! (3a) in the absence of sky emission 
lines, making this by far the deepest intrinsic spectrum published of 
an object from the epoch of reionization, highlighting the difficulty of 
obtaining ultraviolet redshifts for objects at this epoch that are not strongly 
dominated by emission lines. The restframe equivalent width limits 
are <4 A for both Lyo and Cu] 1,909 A. Our search space for Lys is 
largely free of sky emission lines; they cover 16% of the range. 

Fits to the galaxy’s spectral energy distribution (SED) yield a lensing- 
corrected stellar mass of 1.7 X 10° solar masses (M5) (that is, log(M,/ 
Mo) =9. 23+): 75 with a best-fitting stellar age of 80 million years 
(that is, a light-weighted age t of log[t (yr)] = 7.91 15. 38). The lensing- 
corrected ultraviolet luminosity is about 1.8 x 10’°L 5, where Lo is the 
solar luminosity, resulting in a star-formation rate (SFR) estimate of 
2.7+0.3Moyr ‘ based on the ultraviolet emission and uncorrected 
for dust extinction, for a Chabrier initial mass function". A1689-zD1 is 
thus fainter than the turnover luminosity, L*, in the galaxy luminosity 


Figure 3 | ALMA SNR maps of A1689-zD1. Contours are SNR = 5, 4, 3, 2 
(black, solid), -3, -2 (white, dashed). Images and noise maps were primary- 
beam corrected before making SNR maps. Beam sizes are shown at the bottom 
left of each panel. Panels are 8'’ X 8’’. The panels show from left to right: 
the combined data, the two tunings of observation 2011.0.00319.S and 
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the best-fit SED (blue line) are shown. The Lya break is close to the 
spectrograph’s near-infrared (NIR)/visual (VIS) arm split; however, the break is 
clearly detected in the NIR arm alone. A nearby galaxy (z ~ 2) visible in the 
bottom part of the two-dimensional spectrum is detected in both the VIS 
and the NIR arms. 


function at this redshift, meaning that it is among the faint galaxies that 
dominate star formation at this epoch’. 

Mosaic observations of the lensing cluster were obtained with the 
Atacama Large Millimetre Array (ALMA) in Cycles 0 and 1 with the 
receivers tuned to four 3.8 GHz frequency bands between 211 GHz and 
241 GHz. A1689-zD1 is located towards the northern edge of the mosaic 
and is detected at 5.00 with an observed flux of 0.61 + 0.12 mJy in the 
combined image and at 2.4—3.1o significance in each of the three indi- 
vidual observations (Fig. 3). A1689-zD1 is located within the primary 
beam’s full-width at half-maximum (FWHM) of one pointing and the 
sensitivity (root mean square, rms) around its position is 0.12 mJy per 
beam, 42% of the sensitivity of the deepest part of the mosaic. The source 
is the brightest in the mosaic area of 5 square arcminutes. It coincides 
with the ultraviolet position of A1689-zD1 and is 1.5'’ away from the 
next-nearest object in the Hubble Space Telescope image, which is not 
detected in the ALMA map. No line emission is convincingly detected 


observation 2012.1.00261.S. A1689-zD1 is detected from left to right, at 5.0d, 
2.40, 3.10, and 3.00. Natural weighting was used and the visibilities were 
tapered with a 1’’ circular Gaussian kernel, resulting in beams of 

1.36’" X 1.15'’, 1.19" X 1.09’", 1.43’ & 1.12"’, 1.43’’ X 1.17”' from left 


to right. 
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Table 1 | Comparison of A1689-zD1 to other high-z star-forming galaxies 


Galaxy name Redshift, z Stellar mass, M« SFRuv SFRiyx. SFRir Dust mass, Mp 
(10°M.5) (Mo yr) (Mo yr) (Mo yr) (10’7Mo) 
HFLSS3 (ref. 21) 6.34 50+ 30° 1.3+0.4* = 1,300_s20t 30_iot 
HCMEA (ref. 22) 6.56 —_ 942 2 <28 (ref. 26) <10 (ref. 26) 
Himiko (ref. 8) 6.60 152 30+2 3521 <8 <4.72 (ref. 26) 
A1703-zD1 (ref. 23) 6.8 0.7-1.5 73203 = <16 (ref. 26) <5,7 (ref. 26) 
IOK-1 (ref. 24) 6.96 <40 23.9 + 1.4 (ref. 27) 10+2 <10 (ref. 28) <6.4 (ref. 28) 
z8-GND-5296 (ref. 2) 7.51 1: nee 330+71° = <127 (ref. 26) <50 (ref. 26) 
HG090423 (ref. 25) 8.2 <0.05 (ref. 29) <0.38 (ref. 30) —_— <5 (ref. 29) <2t 
A1689-zD1 75 17+ 32 2.7 + 0.3 <0.7 9th 43 


The SFRs are derived from extinction-uncorrected ultraviolet emission, Lyx emission and far-infrared emission, respectively. 


* Derived from the Hubble Space Telescope F160W photometry and corrected for lensing. 
+95% lower bound only. 
{Assuming the same dust parameters assumed for A1689-zD1. 


in the ALMA spectral data, which covers about half of the frequency 
range in which we could expect to find the [C m] 158 jum line, based on 
the range of redshifts allowed by the X-shooter Lyx break detection 
(see Methods). The derived total infrared luminosity Ly, corrected 
for lensing and for cosmic microwave background (CMB) effects’? is 
6.2 X 10'°L ©, corresponding to a SFR of about 9 Mo yr | 1 with a dust 
mass in the galaxy of 4 X 10’Mo, assuming a dust temperature of 35 K. 
The dust mass we determine is conservative and is unlikely to be lower 
than about 2 X 10’Mo. It could in principle be larger by an order of 
magnitude, owing to temperature and geometry effects (see Methods). 
Table 1 presents a comparison of the properties of A1689-zD1 with 
other z> 6.5 galaxies with sensitive millimetre-wavelength observa- 
tions. Most of the sources observed so far are more ultraviolet-luminous 
than A1689-zD1 and have upper limits somewhat above the dust mass 
found here. 

Far-infrared emission requires the production of metals in a galaxy, 
whether in the solid phase as dust, or as ionized gas. To be detected, such 
a galaxy must have enriched its interstellar media with metals and dust. 
The metals are primarily produced and distributed via supernova explo- 
sions and so metal enrichment happens concurrently with massive 
star formation, but the site of dust production is less certain. However, 
the mechanism must be very rapid’ and these observations of A1689- 
zD1 place the strongest direct constraints so far on the rapidity of dust 
enrichment, occurring within only 500 million years of the beginning 
of star formation in the Universe. 

We can derive an approximate gas mass for this galaxy by inverting 
the Schmidt-Kennicutt law based on its size and SFR“ and we deduce 
a gas mass of2*7 X 10’M«. This gives a dust-to-gas mass ratio of about 
17X10 *. And while the uncertainty on the gas and dust masses is 
large—approximately 0.5 dex, dominated by the scatter in the Schmidt- 
Kennicutt law and the unknown dust temperature, where the two values 
are linked through the SFR—the dust-to-gas mass ratio is nevertheless 
high for this redshift, between a half and a few times the Milky Way 
value’’. For a constant dust-to-metals ratio’, this also suggests a high 
metallicity with a similar uncertainty. Such a high metallicity is consis- 
tent with the fundamental mass—metallicity—SFR relation for star-forming 
galaxies’’, from which we determine an expected elemental abundance 
of about half the solar value. Instead of a young, dust-poor galaxy, these 
measurements suggest an evolved system. Finally, the deep upper limit 
on the C1] 1,909 A emission line, of <4 A restframe equivalent width, 
is unusual for line-emitting galaxies at these redshifts'*. From galaxies 
at lower redshift, we might expect an equivalent width as high as 30 A 
for this SFR for a young galaxy’. And while the lack of Lya emission in 
this galaxy could be explained by absorption in the intergalactic medium, 
the absence of Ci] emission cannot, and is consistent with a more 
evolved galaxy. The surface area we are observing in ultraviolet emis- 
sion is only approximately 1.5 kpc” (lensing-corrected size), so the SFR 
per unit area is high, approximately 8 Mo yr_' kpc~’, comparable to 
vigorous starburst galaxies with similar SFRs’*, but an order of mag- 
nitude or more lower per unit surface area than the extreme starbursts 
found in high-redshift submillimetre surveys or quasars”. 


The comparison of the gas mass to stellar mass in this galaxy shows 
that about 55 + 25% of the baryonic matter is in the form of gas, indi- 
cating that the galaxy has already formed much of its stars and metals. 
Taken together, these lines of evidence point to a picture of A1689-zD1 
consistently forming stars at a moderate rate since z ~ 9, or possibly 
having passed through its extreme starburst very rapidly and now being 
in a declining phase of star formation. 

It has been suggested that the decreasing metal contents of high- 
redshift galaxies will make them challenging to detect at far-infrared 
wavelengths®’”. A1689-zD1 is at z = 7.5 and although it is magnified by 
a factor of 9.3, it was detected in only brief observations with ALMA. 
This promises a reasonable detection rate for L* galaxies in unlensed 
fields at these redshifts for the full ALMA array, in contrast to the gloomy 
outlook painted by observations of very-low-metallicity systems’*. The 
precise identification and detailed characterization of the star-forming 
population of the early Universe should therefore be possible in the 
far-infrared in the near future and should not be restricted to rare hyper- 
luminous infrared galaxies. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


All uncertainties quoted are 68% confidence unless stated otherwise. 

Optical spectroscopy. The X-shooter spectrograph on the Very Large Telescope 
was used to observe the source A1689-zD1 on the nights 21 March and 18 April 2010, 
as well as shorter exposures on 27 March, 25 April, 1,2 and 3 May 2011, and 17 and 
24 March 2012. The observations used a standard dither pattern. Approximately 
half the exposure time (the 2010 observations) was performed with a fixed slit posi- 
tion angle, while the rest used the parallactic angle. The slit setups are indicated in 
Fig. 1. The data were reduced with the ESO X-shooter pipeline version 2.2. The stan- 
dard calibration routines were used, with the different nod positions employed to 
subtract the background sky emission. Spectrophotometric standard stars observed 
on the same night as the galaxy were used for flux calibration. The target was 
acquired using an offset from a bright nearby star, calculated from the Hubble 
Space Telescope imaging. Offsets are executed with an accuracy of <0.1”, that is, 
the location of the galaxy in the slit is known a priori to better than a pixel. The slit- 
centre position and offsets along the slit were used to shift and co-add the data into 
a final combined two-dimensional spectrum. 

Spectroscopic redshift. The spectroscopy rules out a low-redshift solution. We 
detect no emission lines in the spectrum (see below), but the stellar continuum of 
the galaxy is detected, allowing us to measure the blue end of the ultraviolet con- 
tinuum anda sharp break, which cannot be reproduced with dust extinction. Other 
breaks are excluded by the sharpness and depth of the break in spectroscopy, the 
blue slope of the ultraviolet continuum, and the deep upper limits in the VIS arm of 
the spectrograph. These data are independent of the original discovery data for the 
dropout, removing any Eddington-bias-like effect and improving the reliability of 
the redshift determination. 

The SNR per spectral pixel is low. We therefore bin the X-shooter spectra by a 
large factor in wavelength (see ref. 31 for details). We fit a template spectrum of a 
~100-million-year-old galaxy to estimate the redshift. This template was shifted to 
arange of redshifts and corrected for the transmission in the intergalactic medium”. 
The best-fit redshift was found from 7” minimization. We tested the fit with a 
variety of bin-sizes: 150, 180, 200, 250 and 400 pixels in the near-infrared. The VIS- 
spectrum data bin sizes are 1.5 times larger. We extracted the one-dimensional 
spectrum for each binning and performed 1,000 realizations of the model where we 
added Gaussian noise corresponding to the uncertainty in each binned pixel. We 
calculated the average of the five binning sizes and determined the average 68% 
confidence level to be z = 7.5 + 0.2. 

To be confident of the location of the spectral break, we also adopt a standard 
methodology for detecting steps in one-dimensional data: searching for a change in 
slope in the unbinned cumulative sum (Extended Data Fig. 1). The break at ~1,035 nm 
is clearly observed only in the data from the NIR arm of the spectrograph. We sim- 
ulated the data 500 times, using the error spectrum as the standard deviation for 
the Gaussian random realizations; following the same cumulative sum method and 
data-cleaning as for the real data, we derived the break position and its uncertainty. 
This is very close to the analysis of the binned data: z = 7.5 + 0.2. The fit results and 
error are independent of the VIS arm data and depend only on the NIR spectrum. 

A galaxy at z ~ 2 is reported"® offset by 1.5” from the lensed galaxy. In Extended 
Data Fig. 2 we show its undiluted spectrum from the 2010 data alone. Its spectrum 
extends across the NIR/VIS arm boundary and does not show the sharp break seen 
in A1689-zD1. In Fig. 2, the spectrum of this nearby galaxy is diluted because the 
spectra are observed at different slit orientations, some of which did not cover the 
companion (see Fig. 1). 

Finally, we fit the full set of photometric data (all Hubble and Spitzer bands) with 

the New-HyperZ code*, version 12.2, to determine a photometric redshift, for 
completeness. Only one redshift solution is allowed, at z ~ 7.4 (Extended Data Fig. 3), 
consistent with the spectroscopic determination. 
Emission line flux detection limits. No emission line is found from visual inspec- 
tion of the spectrum. Of the strong emission lines, at z= 7.5 only Ly could in 
principle be detected (the He 1 1,640 A line would be too faint). We estimated the 
limits for the detection of any emission line by adding artificial emission lines at 
random within the redshift range determined from the spectral break redshift cal- 
culation. These lines were simulated with FWHMs in the range 50-200 kms’ and 
were added to the unbinned two-dimensional spectrum. The one-dimensional spectra 
were extracted and binned by 3-5 pixels. A Gaussian function was fitted to the data 
and the SNR of the artificial line flux was derived. 

We place a 3a upper limit to the Lye emission line of 1.8 X 10°” ergem™*s~ 
at 10,030 A near the sky lines and a factor of seven lower away from the sky lines 
(not corrected for lensing magnification). This limit becomes tighter from the 
blue to the red, that is, at 10,330 A and 10,573 A the limits near the sky lines are 
0.9X 10-7 ergcem™s and 0.5 X 10° '’ ergcm~ s"", respectively. 

To determine the effective Lyx escape fraction, we first convert this limit to a 
nominal SFR by assuming a factor-of-8.7 flux ratio between Lyo and Ha, and a 
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standard conversion between Ho and the SFR™, but assuming a Chabrier initial 
mass function"’, as used throughout the paper. This corresponds to a detection limit 
for the nominal Lyx SFR <0.7 Mo yr | (corrected for lensing). Between the sky 
lines, the upper limit is <0.09 Mo yr’. This implies an effective Lyx escape fraction 
of either <6% or <0.8% by comparison to the total SER of 12 Mo yr |. The rest- 
frame Lyo equivalent width is <27 A, or <4 A between the sky lines. We cannot 
distinguish between Lya absorbed by the galaxy interstellar medium, circumga- 
lactic medium or intergalactic medium. However, the dustiness of the galaxy indi- 
cates that a substantial fraction of the Lyx may be absorbed in the host. 

Similarly, we place a 3c upper limit of 2 x 10’ erg cm’ for the lensing uncor- 

rected flux of the C 11] 1,909 A emission line, which lies ina region of the spectrum 
relatively free of atmospheric effects, corresponding to a restframe equivalent width 
of <4A. 
The ALMA observations and data reduction. We obtained ALMA mosaic obser- 
vations for A1689 in Cycle 0 and Cycle 1 as part of the projects 2011.0.00319.S and 
2012.1.00261.S. The receivers were tuned to cover 211.06-214.94 GHz, 221.46- 
225.34 GHz, 227.06-230.94 GHz and 237.46-241.34 GHz; the 2012.1.00261.S data 
used here cover the first and third frequency setup. The correlator was used in the 
frequency domain mode with a bandwidth of 1,875 MHz in each spectral window. 
The projected baselines range between 12 m and 450 m. The quasar 3C 279 was 
used for bandpass and phase calibration. The distance from 3C 279 to A1689 is 5.9°. 
Mosaicked images were primary-beam-corrected and the weights of the individual 
fields were taken into account. Flux calibration was done using Mars and Titan. 

Data reduction used the Common Astronomy Software Application’ (http:// 
casa.nrao.edu/) versions 3.4 and 4.1 for the Cycle 0 and 1 data respectively. For the 
Cycle 0 data, additional careful reduction of the observatory-provided preliminary- 
reduced data was required, including flagging of noisy data as well as producing 
a frequency-dependent model of 3C 279 to account for the spectral index of the 
continuum emission through bootstrapping from the flux calibrators. Our results 
are, however, not more than lo different from the preliminary-reduced data. 

The data were combined and imaged using CASA 4.2; when imaging, the cali- 
brated visibilities were naturally weighted (resulting beam size 0.8” X 1”) and tapered 
using a two-dimensional Gaussian with 1” X 1” to give greater weight to shorter 
baselines, resulting in a beam-size of 1.36” X 1.15”. The full mosaic will be presented 
ina forthcoming paper (K-K. et al., manuscript in preparation). The SNR images are 
shown in Fig. 3. The flux image for A1689-zD1 is shown in Extended Data Fig. 4. 

The most conservative estimate of the astrometric uncertainty is half the beam 
dimensions, that is, 0.35” X 0.5”, dominated by residual atmospheric phase effects 
and the SNR. Together with four other ALMA-detected sources, we found an aver- 
age offset of 0.4-0.45” between the ALMA centroids and the Hubble centroids. 
Part of these offsets is probably contributed by intrinsically different optical and 
far-infrared morphologies. No noticeable flux is detected from the galaxy 1.5” from 
A1689-zD1. 

No spectral line is detected towards A1689-zD1 in the ALMA data. Our data 
cover [C11] 158 um emission at: z = 6.87-7.00, 7.23-7.37, 7.43-7.58 and 7.84-8.00. 
This covers ~50% of the 1a and 2o ranges allowed by X-shooter. The typical observed 
luminosity ratio Ljcij/Lrir varies between 0.001 and 0.008 (ref. 36). Over the red- 
shift ranges above, we exclude [C11] 158 jm emission at the high end of this ratio 
(>0.0024, 5c for a linewidth of 100 kms"). 

Code availability. The data reduction scripts used for the ALMA data are freely 
available and can be obtained at http://dark.nbi.ku.dk/research/archive. 

Dust mass and temperature. We derive the far-infrared luminosity (42-122 1m 
restframe) and dust mass using a single temperature modified blackbody and 
assuming a typical value for the dust mass absorption coefficient of «, = 0.067 X 


B 
[a +z) xl " kg"! at the observed frequency v = 226 GHz, and fp = 1.92 


(as observed in the high-redshift galaxy HFLS3”’). Different values for the dust mass 
absorption coefficient could result in dust masses +0.18 dex (ref. 38). We explore 
the effect of using more complex full SED models in a section below. 

With only one far-infrared flux point it was necessary to assume a value of the 
dust temperature to recover the dust mass. We assumed 35 K. A lower temperature 
would increase the inferred dust mass, making the galaxy richer in dust than we 
infer. The galaxy may not be optically thin to far-infrared radiation; we assume the 
wavelength of optical depth unity is 29 = 200 um. We apply corrections due to 
CMB heating and the error induced by the large CMB background subtraction’. 
We show the SFR and dust mass inferred using our fiducial modified blackbody 
model for various assumed dust temperatures, /;z and A, values in Extended Data 
Fig. 5. 

The constraints induced by the observed correlation between the ultraviolet 
spectral slope, Syy, and the ratio of observed infrared to ultraviolet luminosity, 
IRX”’, with the flux uncertainties and Byy slope uncertainties included, are also 
shown in Extended Data Fig. 5. The dust mass, Mp, for the fiducial model is 
log(Mp/M«) = 7.6, with a 2¢ lower bound of 7.30, and an upper bound that is not 
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strongly constrained, but where log(/Mp/M ) = 7.9 implies a dust-to-gas ratio approx- 
imately five times that of the Milky Way (see below), and is therefore disfavoured. 
It has been shown recently that galaxies at high redshift may be preferentially more 
dusty”. This higher IRX, by 0.3 dex, is quite consistent with A1689-zD1. 
Star-formation rate. We measure the ultraviolet flux F, from the Hubble Space 
Telescope F160W photometric data point. The ultraviolet luminosity is calculated 
for the best-fit redshift of 7.5 using our standard cosmological parameters*'. No 
dust correction is applied in calculating the ultraviolet SFR. The SFR is derived 
from the ultraviolet and total infrared (3-1,100 um) luminosities*, Lyy and 
Lrg. For the ultraviolet luminosity we find Lyy = 1.8 + 0.2 X 10'°L5, corres- 
ponding to a SFR = 2.7 + 0.3Mo yr’. For the total infrared luminosity we obtain 
Lrip = 6.2 £ 0.8 X 10'°L 5, which is the flux uncertainty only, using our fiducial 
model, but where the true uncertainty is constrained primarily by the desire to 
remain consistent within the scatter with the IRX—fyy relation. These values are 
consistent with those derived from modelling the SED (see below) and result in 
SFRyip = 9 + 2M yr, using recent calibrations based on the 3-1,100 jum infra- 
red luminosity’. A SFRy;z much below 7Mo yr? is excluded in this model. A 
substantially lower dust temperature could result in a higher SFRy;, but would 
imply an excessive dust mass. From the analysis in Extended Data Fig. 5, our best 
estimate is SFRyyp = 97$Mo yr’. 

The sum of the ultraviolet and infrared SFRs results in a total SFR of 

12¢3Mo yr, for this initial mass function (statistical uncertainty only), with a 
likely additional contribution to the luminosity and hence the SFR from the mid- 
infrared emission that is not well-modelled by a single temperature modified black- 
body (see SED modelling below). Therefore, we take this as a conservative estimate 
of the SFR and note that it may be somewhat higher. 
Gas mass and metallicity. We invert the Schmidt-Kennicutt law relating the sur- 
face densities of SFR and gas mass" to obtain constraints on the gas mass. The 
dominant uncertainty on this conversion comes from the uncertainty in the slope 
of the Schmidt-Kennicutt law, which is a +0.4 dex scatter—the dependence on the 
surface area of the galaxy is very small (0.04 dex) since it appears in both the SFR 
and gas mass terms. We here use the relation as originally derived, which reproduces 
even starburst galaxies quite well, assuming a constant carbon monoxide (CO) to 
gas conversion, X(CO)'*. Low-metallicity galaxies may have a lower gas mass for 
a given SFR", which would only increase the dust-to-gas ratio derived here. The 
derived gas mass is inversely dependent on the inferred lensing magnification. 
Since A1689-zD1 is not very close to the critical lines, its lensing amplification is 
likely to be fairly accurate. A different analysis of the data for the cluster gives a 
similar magnification of 8.6 + 0.5 for this source (M. Limousin, private commun- 
ication), confirming that the lens model contributes only a small additional uncer- 
tainty, 0.025 dex, for the calculations presented here. The overall uncertainty in 
the gas mass summed in quadrature is therefore 0.45 dex, including the scatter in 
the Schmidt-Kennicutt relation and the uncertainty on the SFR. 

The dust-to-gas ratio we derive for our fiducial model is 0.017, with a total uncer- 

tainty of 0.5 dex under the assumptions outlined above for the derivation of the 
dust and gas masses. Both values depend on the SFR and this is accounted for in 
the uncertainty on the ratio. The scatter in the Schmidt-Kennicutt relation and the 
dust mass systematics dominate the uncertainty. 
The age, mass and SED of A1689-zD1. We construct the photometric SED of 
A1689-zD1 from the Hubble Space Telescope’s ACS camera F775W and F850W, 
WEFC3 camera F105W, F125W, F140W and F160W, and NIC2 camera F110W 
bands, and the Spitzer Space Telescope’s IRAC camera 3.6 um and 4.5 um bands. 
Since its first discovery with the Hubble Space Telescope, A1689-zD1 was observed 
on April 2010 with the Hubble WFC3 in the F105W, F125W, F140W and F160W 
bands, for 2.5 ks each (proposal ID 11802, Principal Investigator: H. Ford). The 
individual reduced frames were combined using the MultiDrizzle* software onto a 
common pixel scale of 0.05” per pixel and the astrometry was matched to the ACS 
images. Isophotal magnitudes have been measured using SExtractor in double- 
image mode, taking the F140W image as a reference for the detection. The fluxes 
for the source from the newer Hubble imaging are: for FI05W, 25.49 + 0.18, for 
F125W, 25.00 + 0.13, for F140W, 24.64 + 0.05, and for F160W, 24.51 + 0.11. In 
addition, photometry from ref. 10 is used. 

We build a large library of model galaxy SEDs by convolving Simple Stellar Popu- 
lation models* with randomly generated star-formation histories. These are mod- 
elled with exponentially declining laws (parameterized by the formation redshift 
—bound to be younger than the Universe’s age at z = 7.5—and the star-formation 
timescale) with stochastic bursts of star formation (parameterized by their dura- 
tion and fraction of mass formed). The models span a metallicity range between 
20% and 2.5 times solar. Dust attenuation is applied to the stellar SEDs adopting a 
two-component model and randomly generating the values for the total optical 
depth and the fraction of it contributed by the diffuse interstellar medium. Finally, 
transmission in the intergalactic medium is applied*. 


Following the Bayesian approach developed and extensively applied to lower- 
redshift data***°, the observed SED is compared to every model SED in the library. 
This allows us to construct the full probability density function of parameters of 
interest by weighting each model by its likelihood exp(-z7/2) and marginalizing 
over the nuisance parameters of the star-formation histories. For completeness we also 
derive the total (dust-corrected and averaged over the galaxy age) SFR, 9 Zt 3M io) yr, 
and the dust attenuation, Aj609 = 1.0 + 0.4 mag, consistent with the values derived 
elsewhere in the paper. Because of the almost complete absorption of the flux below 
the Ly« by the intergalactic medium, whether we include the ACS upper limits or 
not does not change the fit. The best fit to the ultraviolet-optical SED is shown in 
Extended Data Fig. 6. 

Finally, strong emission lines are not included in the SED modelling. We con- 
sider possible contamination by the Balmer Hy and H6 lines to the 3.6 jm flux and 
the contamination by the Hf and [O 1] lines to the 4.5 tm flux. Adopting a SFR of 
12Mo yr | and standard case B Balmer line ratios, we expect the 3.6 jum flux con- 
tribution to be at most 7%. The nebular line contribution to the 4.5 1m flux could 
amount to 8%-14% (depending on the [O1]/Hf line ratio). By applying these 
corrections to the observed fluxes, our SED fit would yield a stellar mass 0.06 dex 
lower and a light-weighted mean age 0.04 dex younger than obtained above. 

Asa final check, we fit self-consistent full ultraviolet to far-infrared SED models 
using the MAGPHYS” and GRASIL**” codes. From these fits we derived the fol- 
lowing lensing-corrected parameters: SFR = 1073Mo yr’, log(Mx/Mo) = 9.3707, 
Mp = 343 X 10’Mo (MAGPHYS); SER = 9734 Mo yr’, log(Ma/Mo) = 9.4705, 
My = 773 X 10’Mo (GRASIL). Applying the same order of CMB correction to 
these fits as to the modified blackbody would increase each parameter by about 30% 
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Extended Data Figure 1 | Cumulative sum of the unbinned spectrum. detected with the NIR arm only. Gaps in the cumulative spectrum are due to 
The VIS and NIR arms are plotted in blue and red respectively. The best-fitting | removal of regions affected by strong sky absorption. 
step function is plotted as a dashed line. The break in the spectrum is clearly 
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Extended Data Figure 2 | Spectrum obtained only at position angle 64° East spectrum of the companion. Error bars are 68% confidence. The spectrum of 


of North. The slit consistently covered both the emission from the high the companion galaxy is recovered through the entire spectral range, including 
redshift galaxy and the galaxy located 2” below it. This spectrum uses that covered by the transition from the VIS to the NIR data, and shows no 
approximately half of the total exposure time. The upper panel shows the indication of the sharp break seen in A1689-zD1. 


two-dimensional rectified spectrum, the lower panel the one-dimensional 
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data. The probability distribution is based on fitting galaxies using the 
New-HyperZ code”. 
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centred on A1689-zD 1; the image is primary-beam-corrected. The depth of The structure north of A1689-zD1 is a probable detection of a different source 
the map at the location of A1689-zD lis 0.12 mJy per beam (42% of the deepest _in the field and will be presented in a forthcoming paper (K.K. et al., manuscript 
part of the mosaic). The sensitivity decreases towards the edge of the mosaic _in preparation). 
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Extended Data Figure 5 | Dust mass and SFRy pr from modified blackbody 
fits. Tracks show how the parameters change with temperature, with different 
tracks for different opacity wavelengths, Ao. Varying Pip is shown for 

Ao = 200 um (black and grey lines). Intrinsic (CMB-corrected) and measured 
temperatures are indicated for 2) = 300 um (orange line) on the concave and 
convex sides respectively. A diamond marks our fiducial model: uncorrected 
T= 35K, Ap = 200 tm, fy = 1.92. Solid-colour regions show <90%, <95%, 
and <99% confidence intervals due to the Puy-IRX relation (including 
measurement uncertainties, ftp = 1.72-2.12, and Ay < 300 ttm), with solid, 
dashed, and dot-dashed lines indicating these intervals for the tracks. Dotted 


lines mark >99%. 
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Extended Data Figure 6 | Ultraviolet-optical SED for A1689-zD1. Stellar 
Large Telescope (VLT)/X-shooter spectrum is also plotted (solid histogram) 


synthesis models from ref. 43 (BC03) are fitted to the photometric data 
(squares). Error bars are 68% confidence. The best-fitting model is shown in —_ for comparison. 
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consistent with those derived from the modified blackbody (solid line) and 
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Highly efficient star formation in NGC 5253 possibly 
from stream-fed accretion 


J. L. Turner!, S. C. Beck, D. J. Benford’, S. M. Consiglio’, P. T. P. Ho*, A. Kovacs”, D. S. Meier®” & J.-H. Zhao® 


Gas clouds in present-day galaxies are inefficient at forming stars. 
Lowstar-formation efficiency is a critical parameter in galaxy evolu- 
tion: it is why stars are still forming nearly 14 billion years after the 
Big Bang’ and why star clusters generally do not survive their births, 
instead dispersing to form galactic disks or bulges’. Yet the existence 
of ancient massive bound star clusters (globular clusters) in the Milky 
Way suggests that efficiencies were higher when they formed ten billion 
years ago. A local dwarf galaxy, NGC 5253, has a young star cluster 
that provides an example of highly efficient star formation’. Here 
we report the detection of the J = 3—2 rotational transition of CO 
at the location of the massive cluster. The gas cloud is hot, dense, qui- 
escent and extremely dusty. Its gas-to-dust ratio is lower than the 
Galactic value, which we attribute to dust enrichment by the embedded 
star cluster. Its star-formation efficiency exceeds 50 per cent, tenfold 
that of clouds in the Milky Way. We suggest that high efficiency results 
from the force-feeding of star formation by a streamer of gas falling 
into the galaxy. 

The Submillimeter Array image of NGC 5253, shown in Fig. 1, reveals 
a bright CO(3—2) source coincident with the giant cluster and its 
‘supernebula™*. ‘Cloud D’ (ref. 3) is one of only two molecular clouds 
detected within the galaxy; the second cloud is smaller and located ~5” 
(90 pc) to the southwest. A ‘streamer’ of gas extending along the minor axis 
is also detected in CO(3—>2). This streamer, previously detected in lower-J 
CO lines, seems to be falling into the galaxy near the supernebula’». Both 
the streamer and Cloud D emit 870-Lm continuum emission, as shown 
in Fig. 2. Also shown is an image of 350-j1m continuum, in which both 
Cloud D and the streamer are detected. 

The molecular gas in Cloud D is hot. This is clear from the increase 
in brightness from CO(2—1) (ref. 3) to CO(3—2). The intensity ratio 
of the two lines is I32/In, = 2.6 + 0.5(ine = Tie dv). This ratio is non- 
thermal, although the thermal limit of 2.25 (~v’) is within the uncer- 
tainties and is what we adopt. Non-local-thermodynamic-equilibrium 
(non-LTE) modelling of this ratio using RADEX® indicates a mini- 
mum kinetic temperature of Tx > 200K for the 1o lower limit, and 
Tx > 350 K for the adopted value of Iz2/Ip; = 2.25 (see Methods). The 
high gas temperature is consistent with a thermal origin for Hz 2.2-um 
emission in the region’. CloudD seems to be a photon-dominated 
region, heated by ultraviolet radiation from the several thousand clus- 
ter O stars in the cluster**. The CO(3—>2)-emitting gas is dense, with 
nu, ~ (4.5 + 0.5) X 10*cm *. 

By contrast with Cloud D, the streamer consists of more typical cool 
giant molecular clouds. Its value of I32/Ih, = 1.0 + 0.3 is consistent with 
optically thick emission, for which RADEX models allow temperatures 
aslowas Tx ~ 15-20 K, and number densities ny, ~ 3.5-4 X 10*cm °. 
The mass of the streamer is My, ~ 2 X 10°M.°, which is 1-2% of the 
stellar mass of the galaxy. The streamer is molecular, dense, and primed 
for star formation, even before entering the galaxy. This is unlikely to 
be a primordial collapsing filament’, but is more probably previously 
enriched gas. 


The star-formation efficiency of a cloud or region can be defined as 
1 = Motars/(Mgas + Mstars), Where Metars is the stellar mass and M,,, the 
mass of molecular gas. M,,s can be hard to define for star-forming regions 
within giant molecular clouds, but the association of the isolated Cloud D 
with the supernebula gives us an opportunity to calculate 1 directly for 
the giant molecular cloud giving birth to this massive star cluster—if we 
can determine the mass of Cloud D. 

CO is often used to estimate the mass of molecular gas, but this is 
unreliable in NGC 5253. Here we use the width of the CO line to deter- 
mine a gas mass for Cloud D based on dynamical considerations. The 
CO linewidth is ¢ = 9.2 + 0.6kms_', based ona Gaussian fit. The cloud 
dimensions, deconvolved from the beam, are 2.8” X 1.5” + 0.1” (52 pc 
X 28 pc). The virial mass is My;. < 1.818? X 10°Mj for Cloud D, with 
uncertainties due to the unknown internal mass distribution (see Methods). 


Figure 1 | CO J= 32 emission in NGC 5253. The Submillimeter Array 
(SMA) CO(3-2) integrated line intensity, in red, is shown atop a 4814nm 
Hubble Space Telescope image. The SMA beam is 4” X 2” (74 pc X 37 pc). 
The field covers 40" < 40” (740 pc X 740 pc), north up, east left. Image 
registration is to less than 1”. The CO streamer coincides with the optical dust 
lane to the east. The massive star cluster is located at the bright, compact CO 
peak, Cloud D; it is embedded®” and is not visible here. Cloud F is to the 
southwest of Cloud D. 
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Figure 2 | Dust and gas in NGC 5253. SMA image of continuum dust 
emission at 870 um (greyscale), with 350-j1m dust continuum emission from 
SHARC at the Caltech Submillimeter Observatory (contours) superimposed. 
The SMA continuum image has been smoothed to 6” resolution to show 
emission from the streamer. The SHARC image has been smoothed to 12.7”; 
contours are 2¢. Coordinates of the SHARC image are uncertain to ~5” 

(see Methods). 


Weassume that the linewidth is gravitational, with no winds or outflow; 
this is therefore an upper limit to M,;,. The virial mass includes both 
gas and stars, but we can constrain the stellar mass. The mass in stars 
exciting the supernebula, M,ar,, can be predicted from the Lyman con- 
tinuum rate of Niye = (742) X10°s | (ref. 6) and Bracket Y equiv- 
alent width of 255 A (ref. 10). The star cluster has mass Metars = 1.1793 
X 10°Mo (see Methods). We then obtain a gas mass Mgas = Myir — 
Moetars = (7 £ 4) X 10°Mo if the cluster is embedded in the cloud. This 
treatment assumes that the CO kinematics trace all of the cloud and 
that there is no extensive layer of H, without CO; however, the dust 
continuum and CO sizes are nearly identical, consistent with the dust 
and gas mass being contained within Cloud D. 

Other methods to estimate H, mass are problematic for Cloud D. 
CO(3—2) is optically thin, but the excitation temperature is not deter- 
mined, nor is CO/H, known. The line strength of 41+ 8Jykms * 
gives Mco = (4 + 1)Mo (T/200K). For a Galactic abundance ratio of 
[CO]/[H>] = 8.5 X 10°, the H, mass would be My, = 5 X 10°Me 
(T/200 K). This is one-tenth of the value for M,,, derived above. Intense 
radiation fields and high temperature will affect the chemistry and the 
relative abundance of CO. 

Dust continuum emission can trace gas mass, but it is also unreli- 
able in Cloud D. The 870-j1m continuum flux density for Cloud D is 
Ss7oum = 72 + 10 mJy, consistent with previous measurements"’, of which 
free-free emission® contributes ae = 38 + 4 mJy, leaving dust emis- 
sion ce = 34+ 14 mJy. Our 350-1m image gives Sek =10+02)y 
for Cloud D, consistent with the 870-t1m flux. Adopting the dust opa- 
city of the Large Magellanic Cloud’? (see Methods), we find an observed 
dust mass, Maus = (1.5 + 0.2) X 104M (T/45K) |. To obtain gas mass, 
we need a gas-to-dust ratio (GTD). Scaled as the oxygen abundance of 
NGC 5253, 0.2-0.3 solar’*"*, GID ~ 650, within the range 340-1,200 
inferred for the Magellanic clouds'*"'”. The observed dust mass is five- 
fold the ~3,000M. of dust expected for a 2 X 10°Mo gas cloud at 
GTD = 650. If instead we compare the observed dust mass with the 
dynamical estimate of gas mass, we derive GID ~ 47 for the embed- 
ded cluster, and in the unlikely case that the cluster is not embedded, 
GTD = 120. Dust is an expected result of mass loss from massive, short- 
lived stars. Stellar models indicate that a cluster of massive stars of age 
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4.4 Myr, consistent with recombination line equivalent widths, will expel 
20,000-30,000M of elements carbon, oxygen, silicon, magnesium and 
iron depending on the cluster mass and initial mass function, of which 
~30-50% will be in the form of dust (see Methods). To produce the 
amounts of dust and ionizing photons observed, given the upper limit 
imposed by the dynamical mass, suggests that the stellar initial mass 
function is top-heavy, with a lower mass cutoff of at least 2-3M © (see 
Methods.) The star cluster has probably produced most of the dust. To 
infer a gas mass on the basis of the observed dust emission for Cloud D 
froma GTD scaled to the global metallicity of NGC 5253 without account- 
ing for in situ dust production, as has been done for other galaxies’, 
would give an erroneously high gas mass and underestimate the star- 
formation efficiency. 

Given the peculiarities of Cloud D, the most reliable gas mass is dynam- 
ical. We use this mass to calculate the star-formation efficiency, y. A 
lower limit to 7 occurs if the gravitational mass is all gas—no stars— 
so that 7 = 1.1793 X 10°Mo/(1.17 93 + 1.8702) X 10°Mo = 38774%, 
Even in this case, 7 significantly exceeds the 1 < 1% of Galactic giant 
molecular clouds and the highest efficiencies of 1 ~ 15-20% seen in 
individual cloud cores’? in the Galaxy. However, it is almost certain 
that the star cluster is located within the cloud, given the subarcsecond 
positional coincidence of nebular emission and CO, the precise kine- 
matic coincidence of nebular H53o (ref. 8) and CO line centroids, and 
the high extinction to the cluster. Thus the stellar mass also contributes 
to the linewidth. For the more realistic case that the cluster is embedded 
within Cloud D, 7 = (1.1703) X 10°Mo/1.8792 X 10°Ma = 611 84%, 
This value exceeds even the canonical ~50% (see Methods) needed to 
allow a star cluster to survive in its current bound state with rapid gas 
dispersal. If dust competes with gas for ultraviolet photons, the Lyman 
continuum rate and stellar mass have been underestimated, and 7 is 
even higher. If there are winds or outflows contributing to the CO line- 
width, 7 is higher. The large dust mass favours larger-mass cluster models 
for which y is higher. These values for 77 are uncertain, but they are free 
of the systematics due to standard assumptions such as gas-to-dust ratio, 
relative CO abundance or CO conversion factor. The star-formation 
efficiency of Cloud D is unusually high, implying gas consumption 
timescales of ~10 Myr. 

A measurement of star-formation efficiency is a snapshot in time; 7 
could be high because the gas has been incorporated into stars or because 
the young stars have already dispersed the gas. How and when a young 
star cluster disperses its gas is crucial to its survival””°. For Cloud D, gas 
dispersal models are strongly constrained by the youth of the embedded 
star cluster’®, its positional coincidence with the cloud, lack of evidence 
for supernovae’, and small CO(3—>2) linewidth. Apparently not much 
has yet escaped this cloud. 

Cloud D is a strange molecular cloud: hot, dusty, and small in mass 
relative to its young star cluster. It is found in a dark-matter-dominated 
galaxy. Its unusual properties may indicate a mode of star formation 
different from that observed in disk galaxies, including luminous infra- 
red galaxies. Models of stochastic star formation for turbulently sup- 
ported giant molecular clouds in our Galaxy suggest that star-formation 
efficiencies are 1% in a free-fall time”, which implies that the ultimate 
efficiency can be limited if star formation is quenched by massive stellar 
feedback. An extended period of star formation might be facilitated 
if Cloud D is compressed by an external influence, as for example by 
a streamer of gas force-fed into the star-forming region by the galac- 
tic potential. Our data in NGC 5253 could support such a model. The 
streamer contains ~2 X 10°M 5 of gas extending ~200-300 pc along 
the minor axis, entering the galaxy at a rate of ~20 pc Myr '. The 
streamer can fuel star formation at the present rate of ~0.1-0.2Mo yr | 
for the next 10 Myr. This dwarf spheroidal galaxy is not rotationally 
supported”; multiple accreting streams from its extensive H 1 halo**” 
could be responsible for its global dynamics and morphology” as well 
as its spheroidal system of massive star clusters spanning billions of 
years in age*”*’. NGC 5253 may illustrate a new mode of highly effi- 
cient star cluster formation triggered by cold-stream accretion”. 
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METHODS 

Submillimeter Array observations. NGC 5253 was observed with the Submillimeter 
Array (SMA)*! on 2011 April 17. The observing frequency was vj = 340.323 GHz 
with 48 adjacent spectral windows covering 4GHz bandwidth for each of two 
sidebands. The CO J = 3—2 rotational transition at vo = 345.79599 GHz was in 
the upper sideband. The array was in the subcompact configuration covering the 
visibility baselines between 9 and 80 kA corresponding to the angular scales between 
29" and 2”. The phase centre was %999 = 13 h 39 min 56.249 s, dj2000 = —31° 38’ 29.00". 
Calibration and reduction were performed with MIRIAD™. The instrumental band- 
pass was corrected using the quasar 3C 279; complex gains were calibrated using 
the nearby quasar J1316—336; the flux density scale was determined from the planet 
model of Neptune. Continuum and line emission were separated using the task 
UVLIN by fitting a linear model to line-free channels. The result is a cube of 25 
channels of 1Okms !andacontinuum map with an effective bandwidth of 8 GHz, 
convolved to a beam 4” X 2”, position angle = 0°, shown in Extended Data Fig. 1. 
Final noise levels were 3 mJy per beam in the continuum map, and 50 mJy per beam 
in the individual 10kms~' channels. 

CSO SHARC observations. The 350-j1m continuum observations were made with 
the SHARC camera” at the Caltech Submillimeter Observatory on 1999 January 
11-12, with 225-GHz opacities around 0.035 and 0.075 for the respective dates. 
The data consist of 2.2 h of on-the-fly mapping with a ~60” chopping secondary 
at 4.132 Hz, and were reduced with CRUSH™, using an enhanced implementation 
of the Emerson II deconvolution algorithm”, which uses sky rotation to fill in the 
poorly sampled spatial frequencies of the dual-beam chop. CRUSH removes direct- 
current detector offset and correlated sky-noise residuals; flatfields detectors based 
on sky response; and performs noise weighting, whitening and despiking. The main 
beam is 9” full width at half-maximum at 350 jum, but the image presented here was 
smoothed to 12.7” resolution. From observations of Mars taken immediately before 
the SHARC observations, at a similar elevation, it is estimated that the pointing is 
good to ~5” root mean squared (r.m.s.). The systematic aperture flux calibration 
of the 350-uum image is estimated to be good to within 7% r.m.s. 

Relation of NGC 5253 to M83 and distance. NGC 5253 is a dwarf spheroidal 
galaxy of the Cen A/M83 galaxy complex*, with a stellar mass of ~1.5 X 10°M5 
(ref. 37) and an estimated** total mass, including dark matter, about tenfold higher. 
It is close to the large spiral galaxy M83 in projection. However, the distance to M83, 
4.8 Mpc (ref. 39), is significantly larger than the distance to NGC 5253, at 3.8 Mpc 
(ref. 40). The H 1 streamer system**”’ in the halo of NGC 5253, from which the CO 
streamer seems to emanate, strongly suggests that this dwarf galaxy has had some 
encounter in its past, but M83 does not seem to be responsible. 

Cloud D CO emission. The J = 3 level of CO corresponds to an energy E,,/k of 
33 K, a temperature that begins to distinguish actively star-forming clumps from 
giant molecular clouds. Cloud D is bright in CO(3—2), but only weakly detected in 
CO(2->1) (ref. 3), and not at all in CO(1->0) (ref. 5). The total flux of CO(3—2) 
emission in the galaxy and streamer is 110 + 20 Jy kms7*, about 30% less than the 
single-dish flux*’. This is a typical value for local galaxies, because the array config- 
uration is insensitive to structures <30" in extent; the value is consistent with the 
extended streamer emission and with the JCMT-SCUBA continuum image. The 
CO(2— 1) image is shown in Extended Data Fig. 2, overlaid on the SMA CO(3—2) 
image. CO(3—>2) was not detected in previous SMA observations'’ because of 
insufficient signal-to-noise. Located at 2090 = 13h 39 min 55.943 s + 0.003 s, 
dy2000 = —31° 38' 25.097" + 0.05", the Cloud D CO(3-2) source is coincident to 
within +0.5” with the core of the supernebula as defined by high-brightness 7-mm 
free-free emission’. The CO(3—2) line centre is at a heliocentric velocity of 397.5 + 
0.6kms_' and the CO flux of Cloud Dis 41 Jykms™ '. The size of the CO source in 
the integrated intensity map deconvolved from the beam is 2.8” X 1.5” + 7%, posi- 
tion angle 12° + 1°. The slight northward extension is consistent with features 
seen*®?!34 in free—-free emission, but it is also in the same direction as the elon- 
gation of the beams for northern synthesis arrays for this source. 

Cloud D virial mass. The width of the CO(3—>2) lineiso = 9.2 + 0.6kms ', based 
on a least-squares fit to a Gaussian line profile. We adopt a value for the radius of 
half the full width at half-maximum of the geometric mean of the deconvolved 
source size, using Myir = «M owr, where v = 2.350 (inkms_') andrisin parsecs, 
with coefficients of « = 190, for p~r~ 3 adopted here, and « = 126 for p ~ r? 
and 210 for p ~ 7 giving the uncertainty limits in Myir (ref. 45). We assume that 
the cloud is turbulently supported against gravity and dispersion-dominated, as 
for Galactic giant molecular clouds**”; inclination effects should therefore not be 
important. If Cloud D is not bound, or has flows that are super-gravitational, our 
estimate for the virial mass is an overestimate. 

Supernebula stellar mass. The stellar mass is based on STARBURST99 (refs 48, 
49) modelling with the following constraints. First, the 7-mm flux density of the 
supernebula is 47 + 4 mJy for the central 2” (refs 8, 43). From this we obtain a 
Lyman continuum rate of N,y. = 7 X 10°*s | foranebulaat 12,000 K (refs 50, 51). 
Second, the cluster age must be consistent with Bracket y equivalent width of 255 A 


(refs 10, 52) and mid-infrared ionic line ratios®. Third, the cluster must be old 
enough to have Wolf-Rayet (WR) stars, which would explain the WR spectral 
signatures!*"*°°°!>*°>_ Fourth, the cluster must have mass less than the virial mass 
of 1.8 X 10°Mo, which is also the maximum mass allowed by the [SIV] and Bracket « 
linewidths***’. We considered both Geneva high and Geneva v = 40% breakup 
stellar models; the high-velocity models permit older cluster ages. The main param- 
eter to vary in fitting the models is the lower mass cutoff to initial mass function 
(IMF). We consider cluster models with standard Kroupa IMFs with upper mass 
cutoffs of 150M, and top-heavy IMFs. Kroupa IMFs with stellar masses down to 
0.1M6 cannot give cluster ages sufficiently old to have both WR stars and the given 
Niyo in view of the upper limit on the cluster mass. The IMF must be top-heavy: a 
cluster of Mgtars = 1.1493 X 10°Mj requires a lower mass cutoff of >3M.. This 
cluster mass is consistent with previous estimates**”’. The Lyman continuum rate 
inferred from free-free emission may be less than the true value because of leakage 
from the H 1 region. In addition, dust can absorb as much as 50% of the ionizing 
photons in dense Galactic Hm regions®; this could also increase the stellar mass. 
Studies of the extended ionized gas in NGC 5253 (ref. 61) indicate a total galactic 
star formation rate of twice what we calculate for the supernebula, but this could be 
ultraviolet photons from nearby slightly older clusters””**. 

Cloud D continuum and dust mass. The strong 870-11m continuum source towards 
Cloud D consists of equal parts free-free emission from the H 1 region and dust 
emission. The continuum source is located at 2099 = 13 h 39 min 55.948 s + 0.005 s, 
42000 = — 31° 38' 24.88" + 0.11” (J2000). The 870-p1m peak is 0.5” north and 0.32" 
west of the 7-mm continuum supernebula core®’, which is within the uncertainties 
of our 4" X 2” beam. The continuum source agrees in position and size with the CO 
source. The total 870-1m flux density of Cloud D is 72 + 10 mJy. This flux is con- 
sistent with previous observations'', and constitutes ~40% of the total 870-,1m flux 
of 192 mJy for the galaxy as determined from a JOMT/SCUBA map™. We extra- 
polate a free-free flux density from the 7-mm value of 47 + 4 mJy (refs 8, 43), using 
the S, o v~°! spectrum of optically thin emission, giving SSroum = 38 + 4 mJy for 
Cloud D. The dust emission is then Spin = Ss7oum Soin = 34+ 14mfJy. This 
value is just consistent with the upper limit set at 1.3 mm (ref. 3). The SHARC flux 
is Seat = 3.7 + 0.5 Jy, of which ~1.0 + 0.2 Jy originates in Cloud D. 

The dust mass is dependent on the submillimetre dust opacity and dust tem- 
perature. We adopt the opacity of the Large Magellanic Cloud”, extrapolating 
from «(160 jum) = 16 cm” g 1 and B = 1.7 to obtain «(870) = 0.9 cm* ee The 
350-1m and 870-|1m fluxes for Cloud D are consistent with / = 1.7. For dust 
temperature we adopt T ust = 45 K based on IRAS Point Source Catalog fluxes; 
Taust < 57 K based on IRAS 60-um flux. These values of temperature and opacity give 
Must = (1.5 + 0.1) X 10*M 5 (T/45 K) ' for the dust mass of Cloud D, with the un- 
certainty based on the flux. Previous determinations of the dust mass in NGC 5253 
were for the entire galaxy, including the streamer, based on the large-aperture 
JCMT/SCUBA” flux density; if scaled to our flux of 34 mJy for Cloud D only, these 
models’? would give a dust mass of Mgust ~ (2-3) X 10*Mo, consistent with a 
cooler dust temperature. 

Dust yield from the cluster and GTD. The GTD estimated from our dust mass 
of 1.5 X 10*Mo and our gas mass derived from virial and stellar masses is ~47 
(embedded cluster) or 120 (cluster outside cloud). Either is significantly lower than 
the value of 650 predicted™ from scaling the Galactic value of 160 (ref. 65) to the 
metallicity of NGC 5253. We argue that the high dust mass is from in situ enrich- 
ment by the cluster. The Brackett y equivalent width of 255 A (refs 10, 52) indicates 
a cluster age consistent with the presence of WR activity, so mass loss is expected. 
If the original progenitor cloud had a mass of ~2 X 10°Mo, there would initially 
have been ~3,000M q of dust in Cloud D, one-fifth of the amount that we observe. 
From STARBURST99 models with z = 0.004, Geneva high-mass-loss stellar mod- 
els, fora 1.1 X 10°Mo cluster of age 4.4 Myr with an initial mass function of range 
3-150M and log Nyc = 52.84, one would expect a yield in elements C, N, O, Mg, 
Si and Fe of ~24,000M , of which an estimated 30-50% would be in the form of 
dust**®. If the cluster is more massive or more top-heavy, the mass loss can be higher. 
The Geneva rotating models give yields that are lower but still sufficient to explain 
the dust mass. The optical spectrum of NGC 5253 does not yet reflect this localized 
enrichment: the metallicity of ~0.25 solar is based on nebular lines'*'* from the 
extended nuclear H 11 region, whereas the embedded star cluster is behind at least 
16 magnitudes of extinction. However, the small CO linewidth suggests that what 
has been produced in the cloud has so far largely stayed in the cloud. 

Cloud D excitation modelling. The CO(3—2) to CO(2— 1) line ratio can be used 
to constrain gas density and temperature. Extended Data Fig. 2 demonstrates clear 
differences between the emission in Cloud D and the streamer. RADEX® was used 
to perform non-LTE modelling of the line ratios. We assumed a black-body radi- 
ation field and a column density of 10'° cm”, consistent with the observed bright- 
ness. Allowed values of density and kinetic temperature for the adopted value of 
Ig9/In, = 2.25 and the 1 lower limit of 2.0 are shown in Extended Data Fig. 3. For the 
value of 2.25, the lower limit on the gas kinetic temperature is Tx ~ 350 K; however, 
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the ratio of 2.1 is within the uncertainties and would give Ty > 200 K. The indica- 
tion of thermal ratios in the near-infrared H) line ratios’ would also support a high 
temperature for this cloud. 

Mass of Cloud D based on Xco. Xco masses are based on CO(1-—0) emission, 
which has not been detected in Cloud D. Given that the CO emission is optically 
thin, it seems that the Xc¢o value* would underpredict the H, mass by a factor of ~8. 
Streamer kinematics. The streamer has been detected previously in CO(1—0) 
and CO(2—1) (refs 3, 5). The emission is found at heliocentric velocities of 410- 
430kms_', which is red-shifted by about 20 km s~' with respect to the galaxy and 
the supernebula Cloud D. High-resolution VLA images” also detect HI emission 
coincident with this streamer. The streamer coincides with filamentary emission 
in nebular lines of oxygen” and sulphur®; it has been suggested that this is an ‘ion- 
ization cone’, possibly even due to an active galactic nucleus. We suggest that this 
emission is due to leakage of photons from the starburst, which are ionizing the 
surface of the infalling streamer. 

Streamer CO and dust properties and GTD. The CO(3—2) emission origi- 
nates largely from a single cloud, Cloud‘C’ (ref. 3). For its line strength of 
Inn = 3.9£0.6Kkms |, the ratio Iz9/In; is 1.0 + 0.3. RADEX models of the ratio 
are consistent with optically thick and cooler gas, T ~ 20 K, with n ~ 10°°-10* cm? 
(Extended Data Fig. 3). The molecular mass of the streamer based on CO(2— 1) 
(ref. 3) is My, = 2 X 10°Mo for a Galactic conversion factor, Xco = 2 X 107°cm 
(Kkms_ ')~'. The virial mass is 3 X 10°M 5 (ref. 3). The 870-11m dust continuum 
emission follows the CO (Fig. 2). The 870-1m flux density of the streamer is 26 + 
8 mJy, which is all dust (Fig. 2). Adopting the dust opacity «(870 jum) = 0.9 cm? g ; 
and dust temperature Ty = 20 K, we obtain a dust mass of Maust = 2.6 X 10*Mo. The 
streamer thus has GTD ~ 120 using the virial mass for the gas mass. If the H 1 gas™ 
is added, the total H + H, mass becomes 4.3 X 10°Mo, which gives GTD = 170. 
That the streamer is molecular gas, and—even more surprisingly—dense molecu- 
lar gas, is difficult to understand. Molecular gas favours high-pressure environ- 
ments” such as the midplanes of the central regions of spiral disks. Even though 
the filament seems to be in a low-pressure environment, it is not only molecular, 
but also dense. Models of the streamer as an example of a primordial cooling 
filament, in which the gas collapses towards the centre of the dark-matter poten- 
tial, are able to produce the observed inflow rate of gas of ~0.1-0.2Mo yr’_' but 
are unable to reproduce the formation of the observed giant molecular clouds’. 
The streamer may be previously enriched gas. 

Cloud F. Cloud F, located about 5” to the southwest of Cloud D, was not detected 
in previous CO observations*”. Clouds D and F are the only two detected giant molec- 
ular clouds within NGC 5253 proper. Using the Galactic CO conversion factor 
and assuming optically thick emission, for the observed flux of 17 + 6Jykms * 
we obtain a mass of My, ~ 4 X 10°M for Cloud E. 

Star-formation efficiency and cluster survival. The canonical value of 1 = 50% 
is from virial considerations for the survival of a bound cluster with mass loss on 
timescales less than the crossing time”®. It is possible for a cluster to survive with 
lower efficiency, to ~30%, if the gas is lost slowly and the cluster expands””!. 
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Extended Data Figure 1 | Channel maps of CO(3—>2) emission in noted on the individual maps. The colour bar range maximum is 1 Jy per beam 
NGC5253. Positions are relative to a reference position 13h 39 m 56.249 s, for each 10kms ' channel. The beam is 4" X 2”, p.a. 0°. 
—31° 38’ 29” (J2000). Channels are 10 kms’ wide; the heliocentric velocity is 
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Extended Data Figure 2 | SMA image of CO(3—2) with CO(2—1). 9.7" X 5", p.a. —84°, beam of the CO(2—1) image. The colour image flux 
CO(3—>2) emission is shown in colour, with an image’ from the Owens Valley _ range is 3-40 Jykms ' per beam. Contours are linear multiples of 4 Jy kms 
Millimeter Array of CO(2—1) in contours. The SMA CO(3—>2) image has per beam. 

been smoothed from its original 4” < 2”, p.a. 0° resolution to match the 
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Extended Data Figure 3 | RADEX modelling of Cloud D and streamer. 


Escape probability transfer modelling of the CO(3—>2) to CO(2—1) line ratio. 


Models were run using a black-body radiation field, spherical escape 
probability and Nco = 10'° cm”. a, Cloud D. Magenta is for the value 
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I52/In1 = 2.25, the optically thin limit, and indigo is for the 1¢ lower limit to the 
measured value of 2.6. There is no solution for I33/I,, = 2.6. b, Streamer. 
Magenta is for I3>/I,, = 0.98, and indigo and green are solutions for the —1¢ 
and +1o values, respectively. 


©2015 Macmillan Publishers Limited. All rights reserved 


| sid al Be 


doi:10.1038/nature14214 


Ligand-enabled meta-C-H activation using a 


transient mediator 


Xiao-Chen Wang", Wei Gong", Li-Zhen Fang’, Ru-Yi Zhu’, Suhua Li’, Keary M. Engle! & Jin-Quan Yu! 


Achieving site selectivity in C-H functionalization reactions is a 
significant challenge, especially when the target C-H bond is distant 
from existing functional groups’. Coordination of a functional 
group to a metal is often a key driving force and control element 
in many important reactions including asymmetric hydrogenation‘, 
epoxidation’* and lithiation’. Exploitation of this effect has led to the 
development of a broad range of directed C-H activation reactions’. 
However, these C-H activation methods are limited to proximal C-H 
bonds, which are spatially and geometrically accessible from the direct- 
ing functional group. The development of meta-selective C-H func- 
tionalizations remains a significant challenge’ >>’. We recently 
developed a U-shaped template that can be used to overcome this 
constraint and have shown that it can be used to selectively activate 
remote meta-C-H bonds’. Although this approach has proved to 
be applicable to various substrates and catalytic transformations**, 
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the need for a covalently attached, complex template is a substantial 
drawback for synthetic applications. Here we report an alternative 
approach employing norbornene as a transient mediator to achieve 
meta-selective C-H activation with a simple and common ortho- 
directing group. The use of a newly developed pyridine-based ligand 
is crucial for relaying the palladium catalyst to the meta position by 
norbornene after initial ortho-C-H activation. This catalytic reaction 
demonstrates the feasibility of switching ortho-selectivity to meta- 
selectivity in C-H activation of the same substrate by catalyst control. 

Despite the recent reports of U-shaped templates to direct meta-C-H 
activation (Fig. 1a)'°, the development of more general and efficient 
approaches to achieve meta-C-H activation remains a significant task. 
Inspired by the unique reactivity of norbornene in palladium-catalysed 
reactions of aryl iodides (Fig. 1b)'**, we speculated that ortho-palladacycle 
I (Fig. 1c) could react with norbornene to provide an intermediate that 


Figure 1 | Design of a new approach for 
meta-C-H activation. a, Covalent template 
strategy for remote C-H activation: previously 
reported meta-C-H activation directed by a 
U-shaped template. b, ortho-Alkylation and 
ie) ipso-olefination in the Catellani reaction: the 
norbornene-mediated Catellani reaction 
functionalizes ortho-positions of aryl iodides. 
c, Norbornene as a transient mediator for 
2 meta-C-H activation: the development of a tailor- 
made ligand that switches the ortho-selectivity to 
| meta-selectivity using norbornene as a transient 
mediator (this work). Arp, 4-(CF3)CgF4. The 
activated C-H bonds are shown in blue and the 
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can undergo activation of the meta-C-H bond (intermediate IT). Reac- 
tion of palladacycle II with a coupling partner would form a new C-C 
or C-heteroatom bond, and subsequent f-carbon elimination of nor- 
bornene followed by protodemetallation of the aryl-palladium bond 
would regenerate the palladium catalyst. This reaction pathway would 
therefore provide a new approach to achieving catalytic meta-selective 
C-H activation by using standard ortho-directing groups with norbor- 
nene as a transient mediator. We realized this concept through the design 
of a pyridine-based ligand that suppresses several undesired reaction 
pathways and promotes meta-selective alkylation and arylation of 
phenylacetic acid derivatives. We expect that this approach will enable 
many previously reported ortho-C-H activation reactions to be ren- 
dered highly meta-selective. 


LETTER 


At the outset, we were aware that our proposed meta-C-H activa- 
tion with norbornene as a transient mediator would face several key 
challenges (Fig. 1c). First, it was unclear whether 1,2-migratory inser- 
tion with norbornene could occur from a cyclopalladated intermediate 
(I) and whether this pathway would be faster than the competitive reac- 
tion of the palladacycle with a terminal electrophile. Nevertheless, we 
were encouraged by recent reports”*** of C2-selective C-H alkylation 
of indoles via initial aminopalladation of norbornene with indolyl N-H 
bonds. Second, the resultant intermediate II could undergo reductive 
elimination to form a benzocyclobutene, thereby preventing the desired 
meta-C-H functionalization (Fig. 1c)”. Third, rather than undergoing 
protodemetallation, intermediate III could be further functionalized to 
afford the undesired ortho,meta-difunctionalized products. 
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Figure 2 | Discovery of a ligand that enables meta-C-H alkylation using 

norbornene as a transient mediator. a, Pyridine promotes the Pd-catalysed 
reaction of phenylacetic amide 1a with norbornene to afford benzocyclobutene 
via a Catellani pathway. DCE, 1,2-dichloroethane. b, A modified pyridine-type 
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ligand enables selective meta-C-H methylation. Yields were determined by 
'H NMR analysis of the crude reaction mixture. c, meta-C-H alkylation with 
ethyl iodoacetate. 
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We began our investigation with a phenylacetic-acid-derived N-2,3, 
5,6-tetrafluoro-4-trifluoromethylphenyl amide substrate. This auxiliary 
has proved to be effective in the ortho-C-H activation of C(sp’)-H bonds 
by Pd(11) catalysts’*. To probe the feasibility of each step in the proposed 
catalytic cycle, we first treated substrate la with Pd(OAc), (OAc = 
acetate) and norbenene under various conditions in an attempt to iden- 
tify conditions for 1,2-migratory insertion without success (see Sup- 
plementary Information). Although ortho-C-H activation of 1a does 
not require a ligand”*, our recent finding that 9-methylacridine promotes 
directed ortho-C-H activation” prompted us to search for pyridine-type 
ligands that would facilitate the 1,2-migratory insertion. We found that 
the addition of pyridine as a ligand enabled the 1,2-migratory insertion 
of norbornene into the putative ortho-palladacycle intermediate. Sub- 
sequent meta-C-H palladation and reductive elimination presumably 
then took place to give benzocyclobutene 2a in 61% yield (Fig. 2a). For- 
mation of 2a was not observed in the absence of pyridine ligand. 

Having validated the first key steps in the proposed catalytic cycle 
(ortho-C-H activation, 1,2-migratory insertion, and meta-C-H activa- 
tion), we next sought to identify conditions to prevent benzocyclobu- 
tene formation and promote meta-functionalization with an electrophile. 
In the presence of methyl iodide, the desired meta-methylated product 
3a was obtained in 15% yield, and the yield of 2a decreased to 15% 
(Fig. 2b, L1). Further screening of simple pyridine ligands revealed that 
alkyl substitution at the 2-position improved the yield of meta-methylation 
(L2 and L5, 29% and 37% yield, respectively). We further found that 
pyridine ligands with electron-donating substituents gave improved 
product selectivity (L6-L9), whereas electron-deficient pyridine ligands 
provided both poor selectivity and reactivity (L10-L12). Prompted by 
our recent progress in developing quinoline-based ligands for C-H 
activation**”’, we subsequently examined this class of ligands. Although 
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Figure 3 | meta-C-H alkylation of phenylacetic amides. R-I, methyl iodide 
or ethyl iodoacetate; X, Y, generic substituents; Ac, acetyl; Bn, benzyl; Phth, 
phthaloyl; Ts, 4-toluenesulfonyl. The values under each structure indicate 
isolated yields. For 1j and 1p-1t, dialkylation is observed; reaction conditions: 
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Pd(OAc), (10 mol%) 
L19 (20 mol%) 
Norbornene (1.5 equiv.) 
AgOAc (3 equiv.) 
DCE, 95 °C, 16h 


the use of 2-methylquinoline (L14) and acridine (L15) as ligands led to 
increased reactivity, formation of benzocyclobutene 2a was favoured. 
Because electron-donating substituents enhance selectivity for the meta- 
methylation pathway, we tested several quinoline ligands containing 
alkoxy groups at the 2-position. Although 2-(isobutoxy) quinoline (L16) 
was an ineffective ligand, the use of ligand L17, containing a 2-alkoxy 
group with a constrained conformation in a fused ring, led to full con- 
version, affording 3a in 80% yield. Installation of a highly electron- 
donating amine substituent on a similar ligand L18 did not improve the 
selectivity. Ligand L19, containing a fused cyclopentane ring, further 
improved the selectivity for meta-methylation, providing 3a in 85% 
yield. Selective meta-alkylation with ethyl iodoacetate was also achieved 
under optimized conditions to give the desired product 4a in 67% yield 
(Fig. 2c). 

The scope of this norbornene-mediated meta-alkylation was inves- 
tigated, using methyl iodide (3a—3x) and ethyl iodoacetate (4a—4x) as 
coupling partners (Fig. 3). Electron-donating and electron-withdrawing 
ortho-substituents were tolerated in this transformation, providing 
moderate to excellent yields (3a-3h and 4a—4h). An ortho-bromide sub- 
stituent was also compatible with this protocol (3d and 4d). Alkylation 
of 1-naphthylacetic amide 1i occurred exclusively at the 3-position, 
rather than at the 8-position (3i and 4i). When an unsubstituted amide 
substrate 1j was used, methylation occurred at both meta positions to 
provide dimethylated product 3j in 80% yield. Coupling of ethyl iodo- 
acetate with this substrate, however, mainly produced the benzocyclo- 
butene by-product; the yield of the desired dialkylated product 4j was 
less than 10%. This result demonstrates the subtle reactivity differences 
between these two alkylating reagents. Substrates bearing electron- 
donating or electron-withdrawing meta-substituents were also reactive 
in this protocol (3k-3n and 4k-4n). Methylation of the more sterically 


3a-3x (when R = Me) 
4a—4x (when R = CH,CO,Et) 


Br F CF 
NHAr, NHAr, NHAr, 
O O fo) 
R R R 
3d, 88% 3e, 86% 3f, 72% 
4d, 81% 4e, 68% Af, 75%* 


R NHAr. FC NHAr, F NHAr, 
O f°) O 
R R R 


3j, 80%t 3k, 72%t 31, 90%* 
4j, <10%t 4k, 51%6t 41, 86% 
Me OAc 
R NHA 
"FoR NHAr, NHAr, 
F " fo) O 
R 
R R 
3p, 85%t 3q, 72%t 3r, 75%t 
4p, 59%t 4q, <10%t 4r, <10%t 
Ox NHAr, 
7 ars Nee 
fo) 
Bv, 71%* 3w, 81%t+ 3x, 73%t 
4v, 69%* 4w, <10% 4x, <10% 


20 mol% Pd(OAc),, 40 mol% L19, 5 equiv. R-I, 5 equiv. AgOAc, 3 equiv. 
norbornene. *, 15 mol% Pd(OAc)s, 30 mol% L19. +, 20 mol% Pd(OAc)s, 
40 mol% L19. 


©2015 Macmillan Publishers Limited. All rights reserved 


hindered p-MeO-substituted substrate 1o was sluggish and poor yield- 
ing, providing 30 in less than 10% yield. Mono-alkylation of lo with 
ethyl iodoacetate afforded 40 in 68% yield. A substrate containing the 
less sterically bulky p-F-group reacted well with both coupling part- 
ners to afford the dialkylated products 3p and 4p in 85% and 59% yields, 
respectively. The methylation reaction proceeded effectively with a broad 
range of o-substituents, including methyl, acetoxyl, benzyloxy and 
phthaloyl-protected amino groups to give the dimethylated products 
in good yields (3q-3t). Thus, this transformation enabled the direct 
meta-functionalization of key biologically active building blocks, such 
as mandelic acid (3r and 3s) and phenylglycine (3t). In contrast, the 
ethoxycarbonylmethylation reaction was effective only for 1s, contain- 
ing a benzyloxy group at the benzylic position (4s). Both alkylation reac- 
tions were compatible with tetrahydronapthalene (1u) and tetralone 
(1v) substrates, despite the amide directing groups being out of plane 
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Figure 4 | Scope of organohalide coupling partners. a, meta-C-H alkylation 
with alkyl halides. Alkyl-X, iodoethane or benzyl bromides. b, meta-C-H 
arylation with aryl iodides. TBME, t-butyl methyl ether. The values under 
each structure indicate isolated yields. *, 15 mol% Pd(OAc)2, 30 mol% L19. 

+, Dichloromethane as solvent. t, 6 equiv. Ar-I; AgOPiv (OPiv = pivalate) used 
instead of AgOAc. 
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with the target C(aryl)-H bonds. Meta-methylation was effective for 
the heterocyclic dihydrobenzofuran-containing and indoline-containing 
substrates lw and 1x. However, ethoxycarbonylmethylation of these 
heterocycles provided only trace product. 

We subsequently examined the generality of this meta-C-H func- 
tionalization strategy by investigating additional organohalide coupling 
partners (Fig. 4). Ethylation with ethyl iodide under standard condi- 
tions led to exclusive formation of the undesired benzocyclobutene 
by-product. Increasing the loading of ethyl iodide (6 equiv.) afforded 
meta-ethylated product 5 in 21% yield, with the benzocyclobutene still 
being formed as the major product. Meta-alkylation with benzyl bro- 
mide, a more reactive electrophile, proceeded smoothly to afford 6a in 
66% yield. Both electron-donating and electron-withdrawing substi- 
tuents on the ortho, meta and para positions of the benzyl bromide 
coupling partner were tolerated, providing meta-benzylated products 
6b-6g in 60-75% yield. 

Finally, we examined whether this approach for meta-C-H functio- 
nalization could be adapted to achieve meta-C-H arylation, thereby 
providing a method for the synthesis of biaryl compounds. Arylation 
with methyl 2-iodobenzoate in t-butyl methyl ether (TBME) provided the 
meta-arylated product 7a in 64% yield. The use of methyl 3-iodobenzoate 
or methyl 4-iodobenzoate, however, resulted in the formation of the 
benzocyclobutene adduct as the major product. These experiments 
suggest that the presence of an ortho-coordinating group is required 
to promote oxidative addition of the aryl halide, which is consistent 
with previous studies’’*°. We therefore examined other substituted 
2-iodobenzoates and were able to obtain the desired products in mod- 
erate yields (7b-7d). Similarly, 2'-iodoacetophenone was a competent 
coupling partner, affording 7e in 48% yield. Arylation with highly reac- 
tive, electron-deficient 3,5-bis(trifluoromethyl)iodobenzene was also 
effective, providing the meta-arylated product 7f in 76% yield. The 
successful meta-arylation of substrates containing dihydrobenzofu- 
ran (8, 82% yield) and indoline (9, 69% yield) groups with methyl 
2-iodobenzoate is especially noteworthy in terms of demonstrating late- 
stage diversification of heterocycles with possible applications in medi- 
cinal chemistry. We expect that further development of the ancillary 
pyridine ligand will broaden the scope of alkyl and ary] halides in this 
reaction, ultimately leading to a broadly useful method for meta-C-H 
functionalization using common ortho-directing groups and norbornene 
as a transient mediator. 
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Observational determination of surface radiative 
forcing by CO, from 2000 to 2010 
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The climatic impact of CO, and other greenhouse gases is usually quan- 
tified in terms of radiative forcing", calculated as the difference between 
estimates of the Earth’s radiation field from pre-industrial and present- 
day concentrations of these gases. Radiative transfer models calculate 
that the increase in CO, since 1750 corresponds to a global annual- 
mean radiative forcing at the tropopause of 1.82 + 0.19 Wm” (ref. 2). 
However, despite widespread scientific discussion and modelling of 
the climate impacts of well-mixed greenhouse gases, there is little 
direct observational evidence of the radiative impact of increasing 
atmospheric CO. Here we present observationally based evidence of 
clear-sky CO) surface radiative forcing that is directly attributable to 
the increase, between 2000 and 2010, of 22 parts per million atmo- 
spheric CO,. The time series of this forcing at the two locations—the 
Southern Great Plains and the North Slope of Alaska—are derived from 
Atmospheric Emitted Radiance Interferometer spectra’ together with 
ancillary measurements and thoroughly corroborated radiative transfer 
calculations‘. The time series both show statistically significant trends 
of 0.2 W m ” per decade (with respective uncertainties of +0.06 W m~” 
per decade and +0.07 W m ” per decade) and have seasonal ranges 
of 0.1-0.2 W m. This is approximately ten per cent of the trend in 
downwelling longwave radiation* ’. These results confirm theoretical 
predictions of the atmospheric greenhouse effect due to anthropogenic 
emissions, and provide empirical evidence of how rising CO, levels, 
mediated by temporal variations due to photosynthesis and respira- 
tion, are affecting the surface energy balance. 

Even though Northern Hemisphere atmospheric CO, mixing ratios 
have recently exceeded 400 parts per million (ppm), few investigations 
have directly explored the effect of changes in well-mixed greenhouse 
gases on spectral and broadband radiation fluxes. In fact, doing so has 
proved difficult. Using data from two satellite instruments launched 26 
years apart, Harries et al.* attributed systematically decreased emission 
in CO, and CH, spectral bands to increased opacity from, and rising 
concentrations in, these well-mixed greenhouse gases. However, this effort 
was complicated by uncertainties in instrument performance, short mea- 
surement records from each instrument, and cloud contamination’. 

In principle, CO, forcing can be predicted from knowledge of the 
atmospheric state assuming exact spectroscopy and accurate radiative 
transfer. Forcing can then be estimated using radiative transfer calcu- 
lations with atmospheric temperature, the concentrations of radiatively 
active constituents including water vapour, O3, CH,, N2O, and less pro- 
minent well-mixed greenhouse gases, and changes in CO,. However, 
experimental validation of this forcing is needed outside the laboratory 
because CO, spectroscopy is an area of active research'*"*. Furthermore, 
the fast radiative-transfer algorithms that drive regional and global cli- 
mate models approximate spectroscopic absorption line-by-line calcu- 
lations with errors of about 0.6 Wm * (ref. 14), an amount comparable 
to the forcing by anthropogenic CH, and N,O. 

Surface forcing represents a complementary, underutilized resource 
with which to quantify the effects of rising CO, concentrations on 


downwelling longwave radiation. This quantity is distinct from stratosphere- 
adjusted radiative forcing at the tropopause, but both are fundamental 
measures of energy imbalance caused by well-mixed greenhouse gases’». 
The former is less than, but proportional to, the latter owing to tro- 
pospheric adjustments of sensible and latent heat", and is a useful metric 
for localized aspects of climate response’. We focus here on clear-sky flux 
changes because models predict most of the CO; surface forcing to occur 
under clear-sky conditions’’. 

Specialized atmospheric observations at experimental sites in the 
mid-latitude continental Southern Great Plains (SGP) and the Arctic 
marine North Slope of Alaska (NSA) sites by the US Department of 
Energy Atmospheric Radiation Measurement (ARM) programme’” 
produce the integrated data sets required for an independent diagnosis 
of the surface radiative effects of CO,. We used spectroscopic measure- 
ments from the Atmospheric Emitted Radiance Interferometer (AERI) 
instrument and atmospheric state data at these two sites to test whether 
the impact of rising CO, on downwelling longwave radiation can be 
rigorously detected. By basing this analysis on sets of independent mea- 
surements at high temporal frequency over long duration, we empir- 
ically established how anthropogenic emissions, mediated by variations 
in photosynthetic activity and respiration, are altering the Earth’s surface 
energy balance. 

However, AERI spectral measurements and trends are sensitive to 
many different components of the atmospheric state. To interpret these 
measurements and attribute specific signals to rising CO, requires an accur- 
ate radiative-transfer model that reproduces these spectra on the basis of an 
independent assessment of the state of the atmosphere. The model must 
capture instantaneous signals and long-term trends in the spectra to deter- 
mine the effects of CO, on diurnal to decadal timescales. 

We used the Line-by-Line Radiative Transfer Model (LBLRTM)*, 
which is continuously compared against other line-by-line models* 
and observations’’. A sample clear-sky measured AERI spectrum is shown 
in Fig. la. Figure 1b shows residual spectra produced from the measure- 
ment (‘obs’), minus spectra calculated (‘calc’) using (1) CO, concentrations 
from CarbonTracker 2011 (CT2011)*°, which is a greenhouse gas 
assimilation system based on measurements and modelled emission 
and transport; (2) methane (CH,) profiles from CarbonTracker-CH4 
(ref. 21); (3) ozone (Os) profiles from NASA’s Modern-Era Retrospective 
Analysis for Research and Applications (MERRA)”; and (4) temperature 
and water-vapour profiles from radiosondes (see Methods). The measured 
spectrum in Fig. 1a shows Planck function behaviour near the centre of the 
fundamental (v2) CO, band and exhibits a departure from a Planck curve 
in the P- and R-branches of this feature, indicating that the emission in 
these branches is sub-saturated and could increase with increasing CO. 
Water-vapour features, continuum emission, and O3 emission are seen 
in the infrared window between 800 cm‘ and 1,200cm ‘, and lesser 
features from CH, are seen around 1,300 cm — 1 Calculated transmission 
and the change in transmission with a 22 ppm CO, increase are also shown, 
indicating that weak vibration-rotation features in the far wings of the 
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Figure 1 | AERI spectrum and residual features. a, Sample clear-sky AERI 

(channel 1) spectrum measured at SGP on 14 March 2001 2330Z, transmission 
calculation (Trans.) from LBLRTM, and the difference in transmission (ATrans.) 
calculated for a 22 ppm change in column-averaged CO, (370-392 ppm). b, The 


fundamental and in the infrared window dominate surface radiative forcing 
from rising CO). 

The agreement between a single measured spectrum and the LBLRTM 
calculation (that is, the residual) is generally within 1.5 mWm “sr * 
per cm™’, with notable exceptions. These include the region below 
550m ' where instrumental thermal noise is present, the centre of the 
V2 CO absorption feature, which is sensitive to the temperature within 
the instrument container, and sporadic features between 1,400 cm™' and 
1,800 cm“! that are due to opaque water vapour lines. Nevertheless, over 
90% of the residual is within a 3 envelope established by the noise-effective 
radiance of 0.2mWm “sr ' percm’ '. Noise in the residual spectrum 
does not preclude long-term spectral analysis** because it has no long- 
term bias. As highlighted in the inset of Fig. 1 the spectral root mean 
square (rms) of the residual decreases with the length of time over which 
the spectra are averaged, with asymptotes at 0.3 mW m “sr! percm™' 
and 0.1mWm “sr | percm ' across the AERI band-pass and in the 
CO, R-branch, respectively. 

Over the length of the observation period (2000-2010), the mod- 
elled spectra at both SGP and NSA are dominated by trends associated 
with the temperature and humidity structure of the atmosphere rather 
than the smaller signal from CO). The seasonal and annual trends in 
calculated clear-sky spectra at SGP (Fig. 2a) and NSA (Fig. 2d) are 
dominated by changes in the atmospheric thermodynamic state and 
are of opposite sign depending on the season. These signals arise from 
seasonally dependent clear-sky trends in temperature profiles and water 
vapour concentrations, as determined by radiosondes (see Methods) 
and must be taken into account to determine the forcing from CO. We 
therefore construct counterfactual spectra (such spectra are produced 
from models that keep the CO, concentration fixed) to simulate spectra 
with time-invariant CO, whereby we use temperature and water-vapour 
estimates from concurrent radiosondes to remove the thermodynamically 
derived radiometric signals from AERI spectra and isolate the signature 
of CO,. Since most CO} surface forcing occurs in the absence of clouds"®, 
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instantaneous spectral residual (obs — calc) (blue) and a spectral residual from 
observations for March 2001 (red). The inset indicates the running average of 
spectral residual rms of AERI Channel 01 (520-1,800 cm ') and the CO; 
R-branch (690-750 cm’). 


we focus on clear-sky conditions, identified using the Active Remote- 
Sensing of Clouds (ARSCL) Value-Added Product data set. 

Differences between counterfactual spectra and coincidental AERI 
measurements show structure in the major CO, absorption features as 
shown in Fig. 2b and e, at an order of magnitude greater than the long- 
term residual rms in Fig. 1b. Also shown are those spectral features for 
which the trend is non-zero at the 3a level”®. These panels show the unmis- 
takable spectral fingerprint of CO2. The trends in forcing are significantly 
(P < 0.003) different from zero only in the P- and R- branches of the v2 
CO, band. 

We can exclude alternative explanations for the change in these mea- 
surements, such as instrument calibration or the temperature, water 
vapour, or condensate structure of the atmosphere because they would 
produce significant (P < 0.003) trends in other spectral regions outside 
the CO, absorption bands—see Fig. 2b and e. Moreover, the spectral forcing 
from CO, is a strong function of changes in the CO, column concentration, 
and nonlinear interactions between temperature and water vapour were 
weak, as indicated by the lack of statistically significant differences in 
the seasonal and annual spectral trends in the CO, P- and R-branches. 
Therefore, the atmospheric structure of temperature and water vapour 
does not strongly affect CO surface forcing, which is consistent with 
the findings of others*®”’. 

Finally, when we compare counterfactual spectra with radiative trans- 
fer calculations where CO, is derived from CT2011, as shown in Fig. 2c 
and f, we again find spectral structure only in the CO, absorption 
features, thereby confirming that the model is a reliable tool for deter- 
mining the surface forcing from CO). The utility of the counterfactual 
approach is further demonstrated by the probability distribution func- 
tion of the difference in the rms spectral residual of AERI and LBLRTM 
with time-varying and fixed CO, concentrations for 2010 (Fig. 3). The 
two probability distribution functions differ substantially (two-sided 
t-test, P< 0.00001). The distribution of the mean residuals also differ 
significantly (two-sided t-test, P< 0.00001). 
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Figure 2 | Measured and modelled spectral trends for 2000-2010. 

a, Calculated (simulated) SGP AERI clear-sky seasonal and annual spectral 
radiance trends for 2000-2010 with temperature, water vapour, and CO 
changes. b, SGP AERI annually averaged clear-sky spectral residual trends, 
where the residual is the difference between the measurement and a 


The time series of CO; surface forcing, derived from differencing AERI 
measurements and counterfactual calculations at the SGP (Fig. 4a) and 
spectrally integrating and converting to flux, shows clear and increas- 
ing trends in radiative surface forcing and seasonal variability. The 
least-squares trend in the long-term forcing is 0.2 + 0.06 Wm ~ per 
decade and differs significantly (P < 0.003) from zero. The seasonal 
amplitude of the forcing is 0.1-0.2 Wm °, closely tracking the inde- 
pendently assessed pattern in the average CO, concentration in the 
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Figure 3 | Distributions of residual rms values in 2010. Histograms of rms 
spectral residuals (520—1,800 cm™') at SGP with time-varying CO, (dark grey) 
and a uniform 370 ppm CO, profile (light grey). Each count corresponds to 
a separate spectrum measured by AERI in 2000 or 2010. 
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radiative-transfer calculation with CO, fixed at 370 ppm (in black). Red overlay 
indicates channel trends 3o different from zero. c, As for b but showing residual 
trends from differencing calculations based on CT2011 CO, concentrations 
and CO, fixed at 370 ppm. d, As for a but for NSA. e, As for b but for NSA. 
f, As for c but for NSA. 


lowest 2km of the atmosphere’’. The variation in the power spectral 
density function of surface forcing with frequency (Fig. 4b) shows the 
largest peak associated with springtime photosynthesis and autumn 
respiration. 

The time-series for NSA (Fig. 4c) also has a pronounced seasonal 
cycle and secular trend. The range of CO, surface forcing is similar to 
that of SGP, although the higher frequency variability is much less 
prominent. In addition, the time series of surface forcing at NSA shows 
increasing variability in the latter part of the 11-year analysis record. 
This variability results from increased numbers of samples and fewer 
outages (when not all of the necessary data streams are available to 
derive the CO, forcing) at NSA from 2004 onwards. Nevertheless, the 
least-squares trend in CO; surface forcing at NSA is also 0.2 + 0.07 W m 7 
per decade, with a seasonal range of 0.1 Wm ° and differs significantly 
from zero (P < 0.02). 

Increasing atmospheric CO2 concentrations between 2000 and 2010 
have led to increases in clear-sky surface radiative forcing of over 
0.2Wm ” at mid- and high-latitudes. Fossil fuel emissions and fires 
contributed substantially to the observed increase”. The climate per- 
turbation from this surface forcing will be larger than the observed effect, 
since it has been found that the water-vapour feedback enhances green- 
house gas forcing at the surface by a factor of three** and will increase, 
largely owing to thermodynamic constraints”. The evolving roles of 
atmospheric constituents, including water vapour and CO, (ref. 30), in 
their radiative contributions to the surface energy balance can be tracked 
with surface spectroscopic measurements from stand-alone (or networks 
of) AERI instruments. If CO, concentrations continue to increase at the 
current mean annual rate of 2.1 ppm per year, these spectroscopic mea- 
surements will continue to provide robust evidence of radiative pertur- 
bations to the Earth’s surface energy budget due to anthropogenic climate 
change, but mediated by annual variations in photosynthetic activity. 
These perturbations will probably influence other energy fluxes and key 
properties of the Earth’s surface and should be explored further. 
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integrated (520-1,800 cm!) CO, surface radiative forcing at SGP (in red) 
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METHODS 


The methodology for this investigation focused on analysing time series of down- 
welling infrared spectra to determine the effects of CO, on these measurements 
and thereby to estimate its surface radiative forcing. The analysis also used tem- 
porally coincident measurements of atmospheric temperature and water vapour, 
retrievals of cloud occurrence, and assimilation estimates of CO, to construct 
simulated counterfactual measurements where CO, is held fixed. 

Code and data availability. The measurement data sets used for this analysis are 
freely available through the ARM data repository (http://www.arm.gov). The radi- 
ative transfer codes are also freely available at http://rtweb.aer.com. CarbonTracker 
results were provided by NOAA/ESRL (http://carbontracker.noaa.gov). CarbonTracker- 
CH4 results were provided by NOAA/ESRL (http://www.esrl.noaa.gov/gmd/ccgg/ 
carbontracker-ch4/). The MERRA data used in this analysis are freely available 
for download at ftp://goldsmr3.sci.gsfc.nasa.gov/data/s4pa/MERRA/MAI3CPASM. 
5.2.0/. The Broadband Heating Rate Profile (BBHRP) data files, used to assess fair- 
weather bias, are freely available on the ARM BBHRP web page at http://www.arm. 
gov/data/eval/24 under http://dx.doi.org/10.5439/1163296 for the time-varying data 
stream and http://dx.doi.org/10.5439/1163285 for the fixed CO, data stream. The 
computer routines used in this analysis will be made available upon request. 
Schematic. The schematic (Extended Data Fig. 1) shows how CO) surface forcing 
is derived from differencing Atmospheric Emitted Radiance Interferometer (AERI) 
observations with a model calculation based on coincidental temperature and 
water vapour profiles. This produces a radiance difference (called a spectral resid- 
ual) that is converted to flux units through a conversion factor calculated with a 
radiative-transfer model based on local meteorological conditions. The spectral 
residual, which is driven by observations, shows features only in the CO2 absorp- 
tion bands, meaning that this is a signal just from CO). The integration of forcing 
over frequency gives a broadband forcing term and contains a secular trend due 
largely to anthropogenic emissions. 

Data sets. Spectra observed by the AERI instrument channel 1 were analysed over 
the period from 2000 through the end of 2010 at the ARM SGP and NSA sites. The 
AERI instrument contains two channels that cover a spectral range of 520-3,020 cm — 
(19.2-3.3 jum) with a resolution of 0.5 cm '. Calibrated spectra are recorded every 
8 min. Channel 1 observes the 520-1,800 cm! spectral region, which covers at 
least 99.97% of the longwave CO, surface forcing. The absolute calibration is 
traceable to NIST standards*’ and has been transferred from blackbodies with 
uncertainties better than 0.05 K (3a). The resulting combined absolute uncertainty 
of each AERI spectrum is 1% (3¢)”. A summary of the AERI instrument’s spectral 
stability can be seen in Extended Data Fig. 2, which shows the time series of effective 
laser wavenumber that is used to define the instrument’s spectral scale. This time 
series exhibits a trend in the data of 4 ppm (relative) per decade around the nom- 
inal 15,799 cm" laser wavenumber. Given that the uncertainty in the AERI spectral 
calibration is +5 ppm (relative) (3), we have confidence that there is no statistically 
significant drift in the AERI spectral calibration over the course of the observation 
period (2000-2010). 

Although the AERI record extends from 1995 to the present, temporally resolved 
estimates of the CO, concentration profiles from CT2011”°, which were necessary 
for validation of the radiative transfer model (see below), were only available 
beginning in 2000 and extending through 2010, resulting in an analysis period 
of 11 years. Overall, the AERI is a very robust instrument that inherently produces 
continuous, reliable operational data. Quality control for the spectra was achieved 
by ensuring that valid sky spectra were being observed by the instrument with the 
hatch open, and that unphysical radiance values, anomalously low variances in 
brightness temperature across the infrared window band (800-1,200 cm!) and 
scenes with substantial variability in the view of the hot blackbody were removed. 

CO, profile best estimates were obtained from CT2011, which reported eight 
daily profiles in the vicinity of, and which were spatially interpolated to, the SGP 
and NSA sites at the European Centre for Medium-Range Weather Forecasting 
forecast model levels on a 40° latitude by 66° longitude grid covering North 
America and may have spin-up effects*’. The CT2011 assimilates data from many 
sources, including the average 12:00 to 16:00 local time CO, in situ measurement 
from the ARM Precision Carbon Dioxide Mixing Ratio System tower at 60m 
altitude at SGP****. Comparisons of CT2011 against aircraft profiles collected by 
the ARM Carbon Measurement Experiment” at SGP, which are not assimilated 
into CT2011, exhibit a bias of less than 0.5 ppm up to 3,600 m above sea level, with a 
standard deviation ranging from 1 ppm in winter to 3 ppm in summer. Between 
2000 and 2011, the global CO, concentration at the surface increased from about 
369 ppm to about 392 ppm, as measured by ARM-NOAA Earth Science Research 
Laboratory (ESRL) flasks, while the CO, concentrations at SGP and NSA increased 
at 1.95 ppm versus 1.98 ppm per decade, respectively. Time series of CT2011 pro- 
files at SGP and fossil fuel and fire components are shown in Extended Data Fig. 3. 
Output from the CT2011 data set were used to create a baseline CO) value of 


370 ppm from which surface-forcing calculations were determined. These outputs 
enabled us to demonstrate that spectral residuals derived from the Line-By-Line 
Radiative Transfer Model (LBLRTM) version 12.2 (ref. 18) with CT2011 had a 
lower rms difference than spectral residuals derived from the CT2011 baseline. 

Temperature and humidity profiles were gathered from the ARM Balloon- 
Borne Sounding System (BBSS). At the SGP site, radiosondes were launched four 
times per day throughout the period of analysis, while at the NSA site, radiosondes 
were launched two times per day over the analysis period. Three different sound- 
ing technologies have been used at both locations. At the SGP site, the Vaisala RS- 
80 technology was used until April 2001; the Vaisala RS-90 was used between May 
2001 and February 2005; and the Vaisala RS-92 was used from February 2005 to 
present. At the NSA site, RS-80 technology was used through April 2002, the RS90 
was used from April 2002 until January 2005, and the RS-92 technology was used 
since January 2005. To account for the dry bias that has been exhibited particularly 
by the RS-80 sensors”, we scaled the radiosonde-reported humidity profile by the 
ratio of the total column water retrieved from the microwave radiometer**” to the 
radiosonde-derived total column water. We estimate that the corrected radio- 
sonde uncertainty is under 0.5 °C and 5% relative humidity (30), given the tech- 
nology’s stated precision and accuracy specifications. The change in radiosonde 
technology in the middle of the investigation could greatly complicate this ana- 
lysis, and we would expect that a change in technology could introduce a jump- 
discontinuity in the forcing. The technology change occurred near the start of the 
analysis period in 2001, and the water-column amounts across the technology 
change differ at the P = 0.1 level for a two-sided t-test. However, we analysed the 
probability distribution function of the microwave radiometer to BBSS column 
water vapour both before and after the RS90 to RS92 change and did not find 
statistically significant differences (Extended Data Fig. 4). 

Cloud clearing, which was essential to this analysis, was achieved through the 

ARM ARSCL Value-Added Product, which combines the Micropulse Lidar, Milli- 
meter Wavelength Cloud Radar, and Vaisala and Belfort Ceilometers to produce 
six different cloud masks of varying sensitivity"**°. We analysed spectra collected 
exclusively during clear-sky conditions, as identified by the absence of clouds in all 
of the coincident ARSCL masks. 
Thermodynamic trends. Trends in the thermodynamic state at SGP and NSA are 
shown in Extended Data Fig. 5. The radiosonde-derived annually averaged least- 
squared trend in the clear-sky temperature profile at SGP showed an increase in 
spring in the lowest 2km of the atmosphere, but decreases elsewhere. Lower- 
atmosphere water concentrations showed trends of opposite sign depending on 
season and altitude. At NSA, annually averaged clear-sky temperature profile trends 
were positive in the lower troposphere except in spring. Lower-tropospheric humid- 
ity also increased at NSA, except in the autumn. It is important to note that these 
thermodynamic trends are based on clear-sky conditions only and are distinct from 
all-sky trends. 

Still, temperature trends can have a non-zero impact on spectral downwelling 
longwave radiation, which can be seen in Fig. 2e, where spectral trends at wave- 
numbers in the near-wings of the fundamental (v2) CO2 band are negative. The 
cause of the negative trends arises from the nonlinear interactions between CO, 
and temperature trends, especially where there is a contrast in temperature trends 
between the boundary layer and the free troposphere under temperature inversion 
conditions, which are common at the NSA site. The effects of CO2 and temper- 
ature generally produce separable impacts on the spectral downwelling longwave 
radiation, especially in the weak vibration-rotation bands that dominate the observed 
CO), surface radiative forcing. However, the effects interact nonlinearly near the 
centre of the fundamental (v2) CO band”. This nonlinearity complicates the sepa- 
rability of temperature and CO, trends over this narrow spectral range. 
CarbonTracker-CH4 and MERRA data use. All radiative transfer calculations per- 
formed in this analysis used CarbonTracker-CH, profile information. CarbonTracker- 
CH, (ref. 21) provides eight daily profiles on a 40° latitude by 60° longitude global 
grid of component contributions of CH, to the atmospheric concentration (a back- 
ground component, a component due to coal and oil/gas production, a component 
due to animals, rice cultivation, and waste, a component due to wetlands, soil, oceans, 
and insects/wild animals, a component due to emissions from fires, and a com- 
ponent due to emissions from oceans). We sum those components and linearly 
interpolate them to the ARM SGP and NSA sites in space and time to coincide with 
the radiosonde observation. 

We also used NASA’s Modern-Era Retrospective Analysis for Research and 
Applications (MERRA)” for O3 profiles for all radiative transfer calculations. MERRA 
provides eight daily O3 profiles at 42 pressure levels at 1.25° horizontal spacing and 
these were linearly interpolated to the ARM SGP and NSA sites in space and time to 
coincide with the radiosonde observation. 

Radiative transfer. LBLRTM version 12.2 was used to simulate AERI spectra with 
and without CO, increases. LBLRTM was run on the 200 levels that are linearly 
spaced in log(pressure) onto which the radiosonde temperature and humidity 


©2015 Macmillan Publishers Limited. All rights reserved 


profiles have been interpolated. The conversion from spectral radiance to spectral 
flux was performed using spectrally dependent radiance-to-flux conversion factors 
determined through three-point quadrature flux calculations for each profile of 
atmospheric thermodynamic state corresponding to the AERI measurements fol- 
lowing ref. 41. Histograms of the calculated distributions of the spectral conversion 
factor between radiance and flux are shown in Extended Data Fig. 6. These dem- 
onstrate that the conversion from radiance to flux depends on wavenumber, with 
stronger bands showing an Angular Distribution Model value very close to unity, 
indicating that the radiance is isotropic, and weaker bands having lower Angular 
Distribution Model values, indicating that the radiance is anisotropic. 

To determine the contribution of CO, to surface forcing, we removed the signal 
associated with the other radiatively active constituents. To do this, we performed 
a radiative-transfer calculation using LBLRTM with observed temperature and 
water-vapour profiles from radiosondes (with water vapour scaled by microwave 
radiometer column water retrievals) and with a fixed CO, concentration set to 
370 ppm (the CO, atmospheric concentration at the beginning of the analysis 
period) for cloud-free conditions when the AERI instrument produced calibrated 
spectra. We differenced these calculations against the actual AERI spectral mea- 
surements to produce a time-series of residual spectra between 2000 and the end of 
2010. Since the fixed CO, concentration is the primary difference between the inputs 
to each calculation and the actual time-varying atmospheric states, the residual 
spectra contains the signal of changing CO}. We integrated the spectra over fre- 
quency (wavenumber) and converted radiance to flux, based on the quadrature 
calculations described above, to produce a forcing value at each time step. 

We used the Rapid Radiative Transfer Model’* to perform the calculations of 
fair-weather bias, as described below. It calculates lower radiative forcing values 
than line-by-line models for the same atmospheric state'*. 

Error budget. The error budget for each surface-forcing value contains contribu- 
tions from several sources, including radiometric noise, instrument spectral and 
radiometric calibration, and residual removal. Instrument radiometric calibration 
is achieved through a suite of diagnostics, including detector nonlinearity charac- 
terization, electronics calibration, field-of-view testing, and routine views of the 
instrument blackbodies. The achieved radiometric uncertainty is better than 1% 
(30) of ambient radiance for a single observation and decreases with the number of 
observations. The instantaneous noise-effective spectral radiance specification for 
the AERI instrument is <0.2 mW m7 sr! per cm! for 670-1,400 cm}, for the 
rms of a 2-min ambient blackbody view. Spectral calibration is achieved through 
routine comparisons with stable atmospheric lines and has an uncertainty of better 
than 0.08cm ! (corresponding to 5 ppm of the laser wavenumber) (3). Residual 
removal uncertainty consists of two subcomponents: (1) uncertainty in the atmo- 
spheric structure as provided by the BBSS and (2) uncertainty in the spectroscopy 
that informs LBLRTM. The former is derived from the radiosonde uncertainty 
propagated in spectral space with LBLRTM; the latter is derived from the HITRAN 
database uncertainty codes'®. The effects of spectroscopic uncertainty are derived 
from forcing estimates with LBLRTM calculations where line parameters are 
modified according to stated HITRAN error codes, which report uncertainties 
in intensity, half-width, temperature dependence, and pressure shift for each line. 
Trend determination. Both spectral trends, as shown in Fig. 2 and broadband 
trends, as shown in Fig. 4, are determined from a least-squares regression using 
Matlab’s “polyfit” (http://www.mathworks.com/help/matlab/ref/polyfit.html) func- 
tion of the estimate values based on clear-sky conditions only. Uncertainty in the 
trend was calculated from the formulae in Weatherhead et al.”*. The trend in sea- 
sonal amplitude was determined from a least-squares regression from the peaks of 
the monthly averaged time-series of the forcing. 

Fair-weather bias. One key question associated with the methods concerns the 
potential for fair-weather bias in this approach due to the screening of data based 
on the ARSCL cloud masks. A bias could occur if there is a relationship between 
surface forcing and cloud cover that is distinct from, and would alter the trends 
in, what was found under clear-sky conditions. To address this question, we have 
undertaken a set of calculations that follow the BBHRP” calculations based on the 
Radiatively Important Parameters Best Estimate (RIPBE)** Value Added Products 
(see http://www.arm.gov/data/eval/24 for details). This product contains broad- 
band profiles of fluxes and heading rates throughout the atmosphere. We have 
recalculated BBHRP profiles at SGP for 2010 based on fixed CO, concentrations 
and compared those to the original BBHRP profiles (that contained time-varying 
CO,) and subset the data that were identified as clear-sky in RIPBE (through the 
ARSCL flags) in order to estimate this bias. The results are shown in Extended Data 
Fig. 7 and indicate that the all-sky surface forcing is 0.05 W m~’ less than the clear- 
sky forcing, which represents a 25% difference in the two quantities. This finding is 
expected because there is non-negligible overlap between cloud absorption, which 
is broadband, and CO) absorption features, so clouds mask the forcing from COp. 
We note that the difference in clear-sky and all-sky surface radiative forcing values 
is not statistically significant and that the extent of the fair-weather bias depends on 
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the occurrence frequency of cloudy conditions, which varies year-to-year at the 
SGP and NSA sites. 

Surface versus TOA forcing. In general, several studies have found that there is a 
complicated relationship between surface and top-of-atmosphere (TOA) forcing, 
depending on the atmospheric thermodynamic and latent heating structure, with 
the former being less than the latter'*“*. This is further underscored by other findings 
for an increase in CO, from 287 to 369 parts per million by volume (ppmv) which 
shows a forcing at the surface of 0.57 Wm “ anda forcing at 200 hPa of 1.92 Wm * 
from line-by-line models*. 

To specifically address the relationship between forcing at the surface and TOA 
at the ARM sites, we have also undertaken a set of calculations that use the BBHRP 
infrastructure described above to demonstrate the relationship between surface 
and TOA flux changes. Results are shown in Extended Data Fig. 8, and indicate 
that magnitudes of the flux changes at the surface and TOA are correlated, although 
the TOA flux changes are larger because the downwelling surface radiation of the 
former quantity is distributed into sensible and latent heat. We note that the flux 
change at the surface and TOA is calculated by differencing fluxes with time-varying 
CO, minus fluxes with fixed CO, at each altitude, and the decreased emission to 
space and increased emission to the surface leads to flux changes of opposite sign 
for TOA and surface flux changes, respectively. 

Vertical sensitivity. Using LBLRTM, we calculated the vertical sensitivity of 
broadband longwave surface forcing due to perturbations in the atmospheric state 
profile. These calculations were based on model atmospheres, including the US 
standard, the tropical, the mid-latitude summer, the mid-latitude winter, the sub- 
Arctic summer, and the sub-Arctic winter atmospheres, which span a broad range 
of atmospheric thermodynamic states**. We separately perturbed the temperature 
by 1°K, 1% water vapour, 10 ppm CO;, 10% O3, and 1 ppm CH, profiles for a 
1-km-thick layer of the atmosphere from the surface to 20 km. Results are shown 
in Extended Data Fig. 9 and indicate that, with the exception of O3, contributions 
to surface forcing are dominated by the bottom 5 km of the atmosphere, regardless 
of the thermodynamic state. O; perturbations have a more complicated vertical 
structure because Os; is partially transparent in the troposphere and its concentra- 
tion peaks in the stratosphere, meaning that upper-level O3 concentration pertur- 
bations at the 10% level can have a modest impact on the surface energy balance. 
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AERI CO, Trends (2000-2010) 
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Extended Data Figure 1 | Schematic. Schematic of the derivation of surface forcing from AERI observations and calculations based on the atmospheric structure. 
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AERI-01 Spectral Calibration Time Series 
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Extended Data Figure 2 | AERI instrument stability. Time series of the 
AERI-instrument-derived laser wavenumber around a nominal frequency 
of 15,799 cm". 
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ARM PGS + CarbonTracker CO: Profiles ppmv 
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Extended Data Figure 3 | CarbonTracker profiles. a, CT2011 profile time series of CO, at the SGP site. b, CT2011 fossil fuel component of the CO, profile. 
c, CT2011 biomass burning component of the CO, profile. PGS, the ARM Precision Gas System Carbon Dioxide Mixing Ratio System. 
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investigation at the ARM SGP site. Each count corresponds to the scaling 
between a collocated radiosonde and microwave radiometer retrieval. 
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SGP Clear-Sky T Profile Trends (2000-2010) 
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Extended Data Figure 5 | Thermodynamic trends. a, Annual and seasonal 


clear-sky temperature (T) profile trends derived from radiosondes and 
ARSCL data for cloud-clearing at SGP from 2000 to 2010. b, Same as a but for 
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water vapour (H,0) profile trends. c, As for a but temperature profile trends at 
NSA. d, As for b but for water vapour profile trends (in grams of water 
vapour per kilogram of air per decade) at NSA. 
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Histograms of Surface Flux Spectral ADM: SGP Count 
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Extended Data Figure 6 | Conversion from radiance to flux. Histogram of from the ARM SGP site from 2000 to 2010. b, As for a but for the NSA site. 
zenith radiance to flux spectral conversion for AERI channel 1 spectral ADM, Angular Distribution Model. 
channels based on LBLRTM calculations based on the thermodynamic profiles 
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ASFC All Sky - 2010 
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Extended Data Figure 7 | Fair-weather bias. a, Histogram of the difference in flux between BBHRP calculations with time-varying CO, and calculations where 
CO, = 370 ppmv for all profiles at 30-min resolution during 2010 at SGP. b, As for a but for the subset of data identified by the ARSCL as clear-sky. 
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Extended Data Figure 8 | TOA and surface fluxes. a, Occurrence frequency 
(in per cent) plot of tropopause versus surface forcing based on BBHRP 
calculations with time-varying CO, and where CO, = 370 ppmv for all profiles 
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at 30-min resolution during 2010 at SGP for all-sky conditions as identified by 
ARSCL flags. b, As for a but for clear-sky conditions. 
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Extended Data Figure 9 | Surface flux sensitivity to atmospheric profiles. 
a, The sensitivity of the surface radiative flux (F.urs) to the level of a 1°K 
perturbation in temperature for different model atmospheres including 
tropical, US standard (USSTD), mid-latitude summer (MLS), mid-latitude 
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winter (MLW), sub-Arctic summer (SAS), and sub-Arctic winter (SAW)**. 

b, As for a but level perturbations are given as percentage H20. c, As for 

a but level perturbations are 10 ppm CO,. d, As for a but level perturbations are 
10% O3. e, As for a but level perturbations are 1 ppm CHy. 
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Long-term decline of the Amazon carbon sink 


A list of authors and their affiliations appears at the end of the paper 


Atmospheric carbon dioxide records indicate that the land surface 
has acted as a strong global carbon sink over recent decades”, with 
a substantial fraction of this sink probably located in the tropics’, 
particularly in the Amazon‘. Nevertheless, it is unclear how the ter- 
restrial carbon sink will evolve as climate and atmospheric compo- 
sition continue to change. Here we analyse the historical evolution 
of the biomass dynamics of the Amazon rainforest over three dec- 
ades using a distributed network of 321 plots. While this analysis 
confirms that Amazon forests have acted as a long-term net biomass 
sink, we find a long-term decreasing trend of carbon accumulation. 
Rates of net increase in above-ground biomass declined by one-third 
during the past decade compared to the 1990s. This is a consequence 
of growth rate increases levelling off recently, while biomass mor- 
tality persistently increased throughout, leading to a shortening of 
carbon residence times. Potential drivers for the mortality increase 
include greater climate variability, and feedbacks of faster growth on 
mortality, resulting in shortened tree longevity’. The observed de- 
cline of the Amazon sink diverges markedly from the recent increase 
in terrestrial carbon uptake at the global scale’, and is contrary to 
expectations based on models*®. 


The response of the Earth’s land surface to increasing levels of 
atmospheric CO, and a changing climate provide important feedbacks 
on future greenhouse warming®”. One of the largest ecosystem carbon 
pools on Earth is the Amazon forest, storing around 150-200 Pg C in 
living biomass and soils*. Earlier studies based on forest inventories 
in the Amazon Basin showed the tropical forest here to be acting as 
a strong carbon sink with an estimated annual uptake of 0.42- 
0.65 PgCyr ' for 1990-2007, around 25% of the residual terrestrial 
carbon sink**. There is, however, substantial uncertainty as to how the 
Amazon forest will respond to future climatic and atmospheric com- 
position changes. Some earlier modelling studies predicted a large-scale 
dieback of the Amazon rainforest’, while more recent studies predict 
a carbon sink well into the twenty-first century due to a CO; fertiliza- 
tion effect®. The realism of such model predictions remains low owing 
to uncertainty associated with future climate and vegetation responses®” 
in particular changes in forest dynamics”’®"’. Thus, direct observations 
of tropical tree responses are crucial to examine what changes are actu- 
ally occurring and what to expect in the future. Here we analyse the 
longest and largest spatially distributed time series of forest dynamics 
for tropical South America. 


Figure 1 | Trends in net above-ground biomass 
change, productivity and mortality across all 
sites. a—c, Black lines show the overall mean 
change up to 2011 for 321 plots (or 274 units) 
weighted by plot size, and its bootstrapped 
confidence interval (shaded area). The red lines 
indicate the best model fit for the long-term 
trends since 1983 using general additive mixed 


models (GAMM), accounting explicitly for 
differences in dynamics between plots 

(red lines denote overall mean, broken 

lines denote s.e.m.). Alternative analyses of subsets 
of plots that were all continuously monitored 
throughout shorter time intervals confirm that the 
observed trends are not driven by temporal 
changes in individual sample plot contributions 
(Extended Data Fig. 3). Estimated long-term 
(linear) mean slopes and significance levels are 
indicated, and are robust with regard to the 
statistical approach applied (that is, parametric or 
non-parametric, see Methods). Shading 
corresponds to the number of plots that are 
included in the calculation of the mean, varying 
from 25 plots in 1983 (light grey) to a maximum of 
204 plots in 2003 (dark grey). The uncertainty 
and variation is greater in the early part of the 
record owing to relatively low sample size 

(see Extended Data Fig. 4). 
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Our analysis is based on 321 inventory plots lacking signs of recent 
anthropogenic impacts from the RAINFOR network’ and published plots. 
The sites are distributed throughout the Amazon basin and cover all 
major forest types, soils and climates (Extended Data Fig. 1). For each 
plot (mean size 1.2 ha) all trees with stem diameter greater than 100 mm 
were identified, and allometric equations applied to convert tree dia- 
meter, height and wood density to woody biomass or carbon®. Net bio- 
mass change was estimated for each census interval as the difference 
between standing biomass at the end and the beginning of the interval 
divided by the census length. We also derived forest woody productivity 
(hereafter termed productivity) from the sum of biomass growth of sur- 
viving trees and trees that recruited (that is, reached a diameter = 100 mm), 
and mortality from the biomass of trees that died between censuses, 
allowing for census-interval effects (see Methods). Plots were measured 
on average five times and the mean measurement period was 3 years. 
For analysis purposes small plots were aggregated to leave 274 distinct 
units. We report trends since 1983, the first year with measurements 
for 25 plots, up to mid-2011. 

Our data show that mature forests continued to act as a biomass sink 
from 1983 to 2011.5, but also reveal a long-term decline in the net rate 
of biomass increase throughout the census period (Fig. 1a). The decline 


Change in net biomass change 


Mean slope = -0.033 Mg ha™ yr~@ 
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in net biomass change is due to a strong long-term increase in mortality 
rates (Fig. 1c), and occurred despite a long-term increase in productiv- 
ity (Fig. 1b). While mortality increased throughout the period, produc- 
tivity increases have recently stalled showing no significant trend since 
2000 (Extended Data Fig. 3). These time trends are based on a varying 
set of plots over time (Extended Data Fig. 4), but this site-switching 
does not alter the results (see Methods). The observed trends also emerge 
from a separate plot-by-plot analysis (Fig. 2), with increases in mortality 
exceeding productivity gains by approximately two to one. Trends are 
rarely significant at the individual plot level owing to the stochastic 
nature of local forest dynamics, but the mean slopes of net change, 
productivity and mortality all differ significantly from zero. Changes 
in forest dynamics were not geographically limited to a particular area, 
but occurred throughout the lowland South American tropics (Fig. 2). 
While rates of change vary depending on the precise plot set, time window 
and analytical approach used, the trends remain robust (Figs 1, 2 and 
Extended Data Fig. 3). 

Artefactual explanations have previously been offered to explain trends 
in biomass dynamics from plot measurements’*”*. Principally, it has 
been suggested that reported net biomass increases* could be driven by 
recovery of forests from local disturbances'*. However, contrary to 
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Figure 2 | Annual change in net above-ground biomass change, 
productivity and mortality for individual sites. The lines in the left-hand 
panels show the long-term rate of change for 117 plots (or 87 units), estimated 
using linear regressions weighted by census-interval length and for display 
purposes centred around zero. This analysis includes only plots that were 
monitored for at least 10 years and contained three or more census intervals 
with at least one in the 1990s and one in 2000s. Red lines indicate long-term 
trends that negatively affect biomass stocks (for example, decreasing net 


change, increasing losses) and green lines indicate trends that positively 
affect biomass stocks (for example, increasing productivity). Bold black lines 
indicate the mean slope across all plots and confidence intervals (2.5-97.5 
percentiles). Insets in the left panels show the frequency distribution of the 
slopes, with the mean slope and P value for t-test of difference from no slope. 
The maps show the location of the sites, and the colour and arrow length 
indicate the sign and magnitude of the slope, with adjacent plots joined into a 
single site for display purposes. 
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observations from recovering neotropical forests'* and successional 
studies’®, the plots have collectively experienced increased biomass growth 
(Fig. 1), accelerated stem recruitment and death (Extended Data Fig. 6), 
and net biomass change is positively related to changes in stem num- 
bers, but not in wood density (Fig. 3b, c). It is thus unlikely that the 
overall patterns would be driven by recovery from disturbances. Alter- 
natively, increases in mortality have been proposed to arise due to biased 
selection of plots in mature forest patches, which over time accumulate 
disturbances and so decline in biomass’’. The fact that forests and trees 
have continued to get bigger (Extended Data Fig. 5a) is contrary to this 
explanation. In addition, if this were driving the network-wide pattern, 
then the observed trends should disappear if data are reanalysed using 
only the first interval of each plot, but instead they persist. In summary, 
the data suggest that trends are unlikely to be caused by artefactual 
explanations of forests recovering from disturbances or selection of 
mature forest patches (see Supplementary Information for a more 
complete exploration of these potential biases). 

The factors driving the observed long-term changes remain unclear. 
The levelling off of productivity in the most recent decade (Fig. 1b and 
Extended Data Fig. 3f) could be due either to a relaxation of the growth 
stimulus itself, or to the onset of a counteracting factor depressing growth 
rates. The recent demonstration of Amazon-wide carbon sink suppres- 
sion during a drought year’® indicates one possible driver. Tropical drought 
is also often associated with higher temperatures, which may further 
contribute to reducing productivity’ and carbon uptake’*. The past 
decade in Amazonia has seen several droughts”? and warming”, which 
coincide closely with the stalling productivity across Amazon forests. 

The increased rate of biomass mortality is driven by an increasing 
number of trees dying per year (Extended Data Fig. 6c) rather than an 
increase in the size of the dying trees (Extended Data Fig. 5c). Several 
mechanisms may explain this increase in loss of biomass due to tree 
mortality, with recent climate events being an obvious candidate. The 
plot data clearly show short-term peaks in the size of dying trees during 
the anomalously dry years 2005 and 2010 (Extended Data Fig. 5c). These 
are consistent with results from rainfall exclusion experiments in Ama- 
zonia’'”’ and observations* showing that large tropical trees are vulner- 
able to drought stress. However, our data lack the signature expected if 
drought were the dominant long-term driver of the increasing loss of 
biomass due to mortality in Amazonia. That is, there has been no long- 
term change in the size of dead trees (Extended Data Fig. 5c), living 
trees have continued to get bigger (Extended Data Fig. 5a), and the in- 
crease in stem mortality predates the drought of 2005 (Extended Data 
Fig. 6c). 


Mean = 0.06 , n = 234 plots 
R? = 0.868, P < 0.001 


s 


Mean = -0.4 , n = 234 plots 
R? = 0.088, P < 0.001 


Alternatively, the increased productivity may have accelerated tree 
life cycles so that they now die younger. Large stature is associated with 
size-related hydraulic” and mechanical failure, reproductive costs” 
and photosynthetic decline’’. Faster growth exposes trees to these size- 
related risks earlier, as evidenced by tree ring data suggesting that faster 
growth shortens lifespans**”’, and by experimental data showing early 
onset of reproduction under increased CQ, (ref. 28). The observed long- 
term acceleration in stem mortality rates and the plot-level association 
between productivity and the strength of the increase in biomass loss 
due to mortality (Extended Data Fig. 8b) are consistent with such a 
mechanism. While demographic feedbacks are not explicitly included 
in dynamic global vegetation models”, our results suggest that they could 
in fact influence the capacity of forests to gain biomass”, with transient 
rates of ecosystem net carbon accumulation highly sensitive to even small 
changes in carbon turnover times’®. 

Finally, we put our results in a global perspective. According to glo- 
bal records, the land carbon sink has increased since the mid-1990s 
(refs 1, 2). While tropical land contributed significantly to this global 
sink during the 1980s and 1990s, our results show that the total net car- 
bon sink into intact Amazon live biomass then decreased by 30% from 
0.54 Pg Cyr (confidence interval 0.45-0.63) in the 1990s to 0.38 Pg Cyr * 
(0.28-0.49) in the 2000s (see Methods). If our findings for the Amazon 
are representative for other tropical forests, and if below-ground pools 
have responded in the same way as above-ground biomass (AGB), then 
an apparent divergence emerges between a strengthening global terres- 
trial sink on one hand’ and a weakening tropical sink on the other. 
However, from an atmospheric perspective we also note that some of 
the effects of the Amazon changes are yet to be observed, as little of the 
carbon resulting from increased mortality is immediately released into 
the atmosphere’. Instead, dead trees decay slowly, with a fraction also 
moving into a long-term soil carbon pool. The Amazon forest sink has 
therefore become increasingly skewed towards gains in the necromass 
pools, inducing a substantial lag in the probable atmospheric response. 
On the basis of the observed long-term increase in mortality rates, we 
estimate that the atmosphere has yet to see ~3.8 Pg of the Amazon 
necromass carbon produced since 1983 (see Methods), representing a 
30% increase in necromass stocks. The modelled increase in Amazon 
necromass is twice the magnitude of the cumulative decadal decline in 
the live biomass sink from the 1990s to the 2000s (from 5.4 to 3.8 Pg C). 

In summary, we find that the Amazon biomass carbon sink has started 
to decline, due to recent levelling of productivity increases, combined 
with a sustained long-term increase in tree mortality. This behaviour is 
at odds with expectations from models of a continually strong tropical 
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Figure 3 | Relationships between annual net change in biomass of 
individual plots and their annual change in basal area, stem numbers and 
wood density. a-c, The mean values of the rates of changes for basal area 
(a), stem numbers per hectare (b) and wood density (c) are given in each panel 
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along with the R? of the relationship with annual net biomass change and the 
P value of the linear relationship. The number of plots included is 234 (that is, 
those with data on change in basal area, stem numbers and wood density). 
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biomass sink®, and underlines how difficult it remains to predict the role 
of land-vegetation feedbacks in modulating global climate change”’°. 
Investment in consistent, coordinated long-term monitoring on the 
ground is fundamental to determine the trajectory of the planet’s most 
productive and diverse biome. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Forest biometric data. Mature forests were sampled throughout the forested 
lowland tropical areas of South America (below 1,500 m above sea level) that receive 
at least 1,000 mm of rainfall annually. To be included in this study, permanent 
sample plots were required to have two or more censuses. Immature or open forests, 
and those known to have had anthropogenic disturbances owing to fire or selective 
logging, were excluded. The plots are geographically well dispersed throughout the 
Amazon Basin (Extended Data Fig. 1), covering every tropical South American 
country except Suriname. Supplementary Table 1 includes a complete list of plots 
included in this study with the respective size, start and end date for the censuses 
included in this analysis, and names of the main researchers for each plot. A full 
manual for plot establishment and tree measurements of the RAINFOR plot 
network can be found in ref. 31. 

Of the total 321 plots, 232 are from the RAINFOR network. In addition, we com- 
piled biomass dynamics data for 89 plots from published studies, mostly from one 
site for 2001 to 2003 (DUK) (see Supplementary Table 1). For these plots, we sim- 
ply used the available biomass data as published. Note that these studies do not apply 
the same allometric equations, and may have slightly different measurements pro- 
tocols and census interval corrections. While as a general rule all trees with stem 
diameters greater than 100 mm were included in this analysis, palms (Arecaceae) 
or coarse herbs of the genus Phenakospermum were excluded for a few plots (19) 
due to changes in measurement protocols over time in these plots. In addition, for 
a few plots only trees =130 or =200 mm in diameter were recorded in the first cen- 
sus(es). In these cases, we either standardized the biomass data in the first cen- 
sus(es) to trees = 100 mm using the ratio of biomass for trees = 100 and =200 mm 
(seven plots) of later censuses, or we used the slightly different minimum size thresh- 
old for the full period, including only trees =130 mm (for two plots). For full details 
on these specific issues see the online source data. For analysis purposes, plots smaller 
than 0.5 ha that were within 1 km or less of one another were merged, to give a total 
of 274 ‘sample units’. The mean size across all sample units was 1.24ha, and the 
mean total monitoring period was 11.1 years. In total, the study monitored 343 ha 
for a combined total of 4,620 ha years, involving more than 850,000 tree measure- 
ments on around 189,000 individual trees larger than 10 cm diameter. 

The standard protocol for tree measurements in the field is to measure diameter 
at breast height, defined as 1.3 m from the base of the stem. For non-cylindrical 
stems owing to buttresses or other deformities the point of measurement is raised 
approximately 50 cm above the deformity. The exact height of the point of mea- 
surement (POM) was recorded and marked on the trees to ensure that future mea- 
surements were taken at the same point. For those trees where buttress growth 
threatened to reach the initial POM, we raised the height of diameter measurement 
to a new POM, located sufficiently high above the buttresses to avoid interference 
of buttresses with diameter measurements at subsequent censuses. If a change in 
POM was made, we recorded both the diameter at the original POM and the new 
POM, thus creating two disjoint series of diameters measured at different heights. 
To avoid potential biases that can result from not accounting for the POM move- 
ment, following ref. 32 we computed a new diameter series that was calculated as 
the mean of: (1) diameter measurements standardized to the new (final) POM, ob- 
tained by multiplication of measurements at the original POM by the ratio between 
diameter measurements at the new and original POM, and (2) diameter measure- 
ments standardized to the original POM, by multiplying measurements at the new 
POM by the ratio between diameter measurements at the original and new POM. 
The outcome of our analysis was robust with respect to the method of dealing with 
POM changes, giving similar results using several alternative approaches for dealing 
with POM changes including the technique described previously’’ in which diameter 
gains at the new POM are added to the diameter at the original POM. Following 
ref. 32 we used several techniques to avoid or minimise potential errors arising from 
missing diameter values, typographical errors, or extreme diameter growth =4.cm yr * 
or total diameter growth =—0.5 cm across a single census interval (that is, losing 
0.5 cm, as trees may shrink by a small amount due to hydrostatic effects in times of 
drought, and measurement errors can be both positive and negative). For stems be- 
longing to species known to experience very high growth rates or noted as having 
damaged stems we accepted these values. We used interpolation, where possible, 
else extrapolation to correct errors. If neither of these procedures were possible we 
used the mean growth rate of all dicotyledonous stems in the same plot census, be- 
longing to the same size class, with size classes defined as 10 = diameter < 20 cm, 
20 = diameter < 40 cm, and diameter = 40 cm, to estimate the missing diameter 
value. Of all stem growth increments, for 1.7% per census we assigned interpolated 
estimates of diameter, for 0.9% we used extrapolated estimates, and for 1.5% we 
used mean growth rates. 

Computing above ground biomass, sampling effects and scaling up sink esti- 
mates. We converted diameter measurements to AGB estimates using allometric 
equations described previously*, which include terms for wood density, diameter 
and tree height. Tree height was estimated based on established diameter-height 
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relations that vary between the different regions of Amazonia®. Wood density values 
were extracted from a global wood density database (http://datadryad.org/handle/ 
10255/dryad.235; ref. 33). In cases where a stem was unidentified or where no 
taxon-specific wood density data were available, we applied the appropriate genus 
or family-specific wood density values. If none of those was available, the mean 
wood density of all identified dicotyledenous tree stems in the plot was applied. In 
our analysis 80% of the trees were identified to species level, 94% to genus level, and 
97% to family level. All data on tree diameter, taxonomy, and associated botanical 
vouchers are curated under the https://www.forestplots.net/ web application and 
database**. 

The magnitude of the biomass sink for the forested area of the Amazon Basin for 

the 1990s and 2000s was estimated by multiplying the magnitude of total biomass 
change with an estimated area of intact forest, including all open and closed, ever- 
green and deciduous forests for tropical South America (6.29 X 10° ha, according 
to Global Land Cover map 2000; ref. 35). For this calculation we also included bio- 
mass components that were not directly measured, assuming that these pools re- 
sponded proportionally to the measured above ground biomass in trees bigger 
than 10 cm in diameter. It has been shown using destructive measurements of stand 
biomass in central Amazonia that lianas and trees smaller than 100 mm in diameter 
represent an additional fraction of ~9.9% the measured AGB (in trees =10 cm in 
diameter**), and below ground biomass a fraction of ~37% the AGB**. We assumed 
that 50% of biomass is carbon”’. 
Analysing time trends and statistical analysis. The longer a census interval, the 
greater the proportion of growth that cannot be directly observed within the inter- 
val, due to the growth of initially recorded trees that subsequently die during the 
interval, and the growth of unrecorded trees that both recruit and die during the 
interval**“°. Hence, variation in census interval lengths in plots over time will affect 
estimates of woody productivity and mortality rates*’, potentially biasing the long- 
term trends if not accounted for. Using established procedures”, we therefore ex- 
plicitly corrected for the influence of varying census interval length, by estimating 
the following two unobserved components: (1) unobserved recruits, that is, the 
cohort of recruits that both enter and die between two successive censuses, and (2) 
unobserved biomass growth and mortality, due to the growth of trees after the final 
census that a tree was recorded alive. To correct for unobserved recruits, we first 
estimated the number of unobserved recruits (U,) as the number of stems in the plot 
(N) multiplied by the annual recruitment rate (R) multiplied by the mean annual 
mortality rate (M) multiplied by the census interval length (t): U.-= NX RX MXt. 
Weassumed that the diameter of these trees was 100 mm plus growth for one-third 
of the interval using the median growth rate for trees in the 100-200 mm size class. 
The biomass of each tree was estimated by applying the regionally appropriate 
allometric equation’, using the plot mean wood density. To correct for unobserved 
growth and mortality due to trees dying within an interval, we assumed that all 
trees that died during the interval to have died at the mid-point, and assigned growth 
up to this mid-point, estimated as the median growth of all trees in the plot within 
the same size class. Full details of the procedure have been described previously”. 
These estimates of the unobserved biomass dynamics usually accounted for only a 
small proportion of the total woody productivity and mortality (respectively 2.28% 
and 2.74%, on average). 

Mean time trends of biomass dynamics (black lines in Fig. 1 and Extended Data 
Figs 3 and 5-7) were calculated for each month since 1983 as the weighted mean 
across all sample units. As plots vary in total area monitored, we used an empirical 
weighting procedure to account for differences between plots in sampling effort by 
weighting according to the square root of plot area**’. Confidence intervals (95%) 
were estimated using weighted bootstrap sampling. 

To estimate long-term trends in biomass dynamics (cf. Figure 1), we first used 
general additive mixed models (GAMM) from the gamm4 R package”. Estimates 
of the long-term trends were performed by regressing the mid-point of each census 
interval (Extended Data Fig. 2) against the rate of change (net change, mortality or 
gains). Here, systematic plot effects were explicitly accounted for by using plot as a 
random effect in the model. This avoided switches over time in the exact set of plots 
being monitored influencing the long-term trends. As census interval length and 
plot sizes varied, we weighted each data point in the regression by the product of 
the census interval length (in years) times the square root of plot size (in hectares), 
as suggested previously*’. We estimated the linear slope of the long-term trend using 
the lme4 package’. In an identical way to the GAMM, we accounted for plot effects 
and added weights to the regression. To test whether the estimated time trends were 
robust to different plots being sampled over different timeframes, we also repeated 
the above analysis over shorter time windows (1990-2011.5, 1995-2011.5 and 
2000-2011.5) keeping the set of plots used completely constant. Results of this ana- 
lysis are shown in Extended Data Fig. 3. 

The approaches using GAMM and the linear slope calculations are parametric 
and assume normally distributed data, while census-level data on AGB mortality and 
net AGB change are non-normally distributed, showing respectively right-skewed 
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and left-skewed distributions. Thus the observed time series for AGB mortality and 
net AGB change do not strictly meet the criteria for this type of parametric analysis, 
although it might be expected from the central limit theorem that with sufficiently 
large data sets the regression analyses would still have validity. To test explicitly the 
robustness of our estimates for the models of net change and mortality with regard 
to violation of the normality assumption for ordinary least squares analysis, we used 
arank-based estimator for linear models available from the Rfit-package“*. This shows 
that slopes for AGB net change and mortality are similar or else of larger mag- 
nitude using non-parametric tests (that is, slope net change = —0.057 Mgha™' yr *, 
P<0.001, slope mortality = 0.061 Mgha ' yr”, P< 0.001, compared respectively 
to values of —0.034Mgha~' yr * and 0.051 Mgha ' yr 7 using the parametric 
techniques). A test of non-parametric rank based estimations of the slopes of the 
change in standing biomass, or mortality ona per stem basis (Extended Data Fig. 6), 
and of changes in stem numbers and number of trees dying and recruiting per 
hectare (Extended Data Fig. 7), or basal area changes (Extended Data Fig. 8), show 
similar results to that of the parametric tests: there is a significant decrease in net 
change of standing biomass per stem (P = 0.0014), no trend in the losses on a per- 
stem basis (P = 0.47), a significant decrease in the change in the number of stems 
per hectare (P < 0.001), marginally significant increase in number of recruits (P = 
0.051), a significant increase in the number of trees dying (P < 0.001), a significant 
decrease in net basal area change (P < 0.001), and a significant increase in basal 
area mortality (P< 0.001). 

A second method for calculating the long-term trends in biomass dynamics in- 
volved estimating the slopes of the time trends for individual plots (Fig. 2). We did 
this only for those plots that had at least three census intervals, and more than 
10 years of total monitoring length with at least one census interval in the 1990s 
and in the 2000s. These stricter selection criteria were designed to allow us to focus 
ona core set of data most likely to capture long-term patterns in regional biomass 
dynamics. Slopes of biomass dynamics metrics were seldom statistically significant 
(P = 0.95) within plots, due to the stochastic nature of the dynamics data (Extended 
Data Fig. 2). We calculated the mean of the slopes across all plots weighted by the 
product of square root of plot area times the total census interval length. A t-test was 
used to test whether the mean values were significantly different from zero. 


All analyses was performed using the R statistical platform, version 3.0.2 (ref. 45). 
No statistical methods were used to predetermine sample size. 
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Extended Data Figure 3 | Time trends of subsets of net above-ground 
biomass change, above-ground woody productivity and mortality rates for 
plots that were continuously monitored throughout, for the periods 
1990-2011, 1995-2011 and 2000-2011. Locations for the set of plots included 
in the analysis for the different periods are show in the maps in lower panels. 
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The red lines indicate the best model fit for the long-term trends using General 
Additive Mixed Models (GAMM) accounting explicitly for differences in 
dynamics between plots (red lines denote overall mean, broken lines denote 
s.e.m.). Estimated long-term (linear) mean slopes (sl), P values and sample sizes 
(n) are indicated (see Methods). 
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available stem-by-stem data. The red lines indicate the best model fit for the 


long-term trends using General Additive Mixed Models (GAMM) accounting 


explicitly for differences in dynamics between plots (red lines denote overall 


mean, broken lines denote s.e.m.). Estimated long-term (linear) mean slopes 


and significance levels are indicated (see Methods). 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


=~ oF ; 
's a) Net change in number of stems per ha Number of plots= 234 
‘gg oo Slope= -0.077 stems ha‘! yr? 
2 p= 0.016 
§ N 
an 
s+ 
z 
2 Go! i= 
G ss . | 
SOS 
2 
£ yo 
°° 
Z2 9 - 
= : 42 
~] b) Number of recruits per ha Slope= 0.101 stems ha” yr™ 
oe p= 0.001 
& © 
S 
Ze 
° - 
2 , : ; 
S _. 
5 o 
€ hs 
E ‘ 
‘a . 
o 
= , 2 
c) Number dying per ha Slope= 0.152 stems ha“ yr 
p<0.001 
ae 
% 629 
g 
= ° 
2 = s 
> . 
no] “tae 
=] a 8 
= er ee 
6 ene 
1985 1990 1995 2000 2005 2010 
year 
Extended Data Figure 6 | Rates of change in number of stems plus for the long-term trends using General Additive Mixed Models (GAMM) 
annualized fluxes of stems bigger than 10 cm in diameter. a—c, Mean net accounting explicitly for differences in dynamics between plots (red lines 
change in number of stems (a), number of recruits (b), and and number of denote overall mean, broken lines denote s.e.m.). Estimated long-term (linear) 


dying trees (c). Analyses are based on 234 plots, excluding published studies mean slopes and significance levels are indicated (see Methods). 
without available stem-by-stem data. The red lines indicate the best model fit 


©2015 Macmillan Publishers Limited. All rights reserved 


0.20 
| 


LETTER 


= a) Net Basal Area chang Number of plots= 234 
Me _| ; Sa ia Slope= -0.004 m? ha” yr? 
2 o | an =e p= 0,002 
* o = a -—---- ao 
$7 . g 
2 8 | ‘ é 
o ° bhai 
& Se 
S 
S = 
z 

2 

8 7 reer: 2 ha-l yy p-2 
pe eS b) Basal area productivity Slope= 0.004 m? ha‘ yr 
Se p<0.001 
% o |} 
oO 
<£ 
Est 
FS 
g 37 
© 
.2) 
e€ so 
10) 

eee 

Ee .. 
LF c) Basal area losses Slope= 0.007 m* ha yr? 
- | p<0.001 
‘so 
<£ 
~ 2 _| 
£ [= 
8 
= 7 
2 

o 
a7 
= 

xv 

o 

1985 1990 1995 2000 2005 2010 
year 
Extended Data Figure 7 | Basal area change, productivity and mortality. the long-term trends using General Additive Mixed Models (GAMM) 
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Polyploidy can drive rapid adaptation in yeast 


Anna M. Selmecki>?7+, Yosef E. Maruvka*’>, Phillip A. Richmond®’, Marie Guillet’?*, Noam Shoresh®, Amber L. Sorenson®”, 
Subhajyoti De®", Roy Kishony'*"’, Franziska Michor*®, Robin Dowell®” & David Pellman!?** 


Polyploidy is observed across the tree of life, yet its influence on 
evolution remains incompletely understood’ *. Polyploidy, usually 
whole-genome duplication, is proposed to alter the rate of evolu- 
tionary adaptation. This could occur through complex effects on the 
frequency or fitness of beneficial mutations”*’. For example, in 
diverse cell types and organisms, immediately after a whole-genome 
duplication, newly formed polyploids missegregate chromosomes 
and undergo genetic instability* *. The instability following whole- 
genome duplications is thought to provide adaptive mutations in 
microorganisms'*"* and can promote tumorigenesis in mammalian 
cells'*!°, Polyploidy may also affect adaptation independently of ben- 
eficial mutations through ploidy-specific changes in cell physiology’. 
Here we perform in vitro evolution experiments to test directly whether 
polyploidy can accelerate evolutionary adaptation. Compared with 
haploids and diploids, tetraploids undergo significantly faster adap- 
tation. Mathematical modelling suggests that rapid adaptation of 
tetraploids is driven by higher rates of beneficial mutations with 


stronger fitness effects, which is supported by whole-genome sequenc- 
ing and phenotypic analyses of evolved clones. Chromosome aneu- 
ploidy, concerted chromosome loss, and point mutations all provide 
large fitness gains. We identify several mutations whose beneficial 
effects are manifest specifically in the tetraploid strains. Together, 
these results provide direct quantitative evidence that in some envi- 
ronments polyploidy can accelerate evolutionary adaptation. 

To determine how polyploidy affects the rate of adaptation, we per- 
formed hundreds of independent passaging experiments in a poor carbon- 
source medium (raffinose, Fig. 1a), comparing isogenic haploid (1N), 
diploid (2N), and tetraploid (4N) strains (Extended Data Fig. 1 and Ex- 
tended Data Table 1). The evolution experiments were performed as 
competitions between equal numbers of cells of the same ploidy express- 
ing cyan fluorescent protein (CFP) or yellow fluorescent protein (YFP)’”"’, 
where the acquisition and spread of beneficial mutations is visualized by 
divergence from a 50:50 ratio of CFP- and YFP-expressing cells (Fig. 1b). 
The rate of adaptation was determined by measuring the change in 


Figure 1 | Rapid spread of beneficial mutations 
in tetraploid yeast. a, Schematic diagram of the 


evolution experiment. b, Flow cytometry analysis 
low cytometry . 5 . : ‘ 
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4N is red. c, The adaptation rate of the evolved 
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analysed in Extended Data Fig. 2c, d, and the 
equivalence principle model is analysed in 
Extended Data Fig. 2e). Error ranges were obtained 
by parametric bootstrap of 1,000 independent 
realizations (Methods). 
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fitness relative to the diploid ancestor over time (Methods). Over 250 
generations, the tetraploids adapted at a rate that was significantly faster 
than haploids or diploids (Fig. 1c; t-test, P< 1 X 107 '°, Methods). This 
faster rate of adaptation in tetraploids may be due to a higher rate of 
beneficial mutations, higher fitness effects of the acquired mutations, 
or both. 

To gain insight into the rapid adaptation of tetraploids, we applied 
two complementary mathematical modelling approaches (see Methods). 
First, we use a model based on a branching evolutionary process”, 
designed to mimic closely the divergence experiments. At each time- 
step, a cell is chosen at random to die or to divide, with a probability 
corresponding to its fitness. Mutations arise with rate 1. If a mutation 
occurs, the fitness of the daughter cell may change and the fitness increase 
is then chosen from a fitness distribution. Second, we use the ‘equivalence 
principle’ model’”, which focuses on beneficial mutations that establish 
in the population, and estimates that these mutations confer a single 
effective fitness advantage. Proliferation of clonal subpopulations under 
this model is deterministic. These simplifications are relevant to exam- 
ples of high clonal interference, and are therefore only appropriate when 
the population size is large or when the beneficial mutation rate is high”. 
In both models, we assume no epistasis; the fitness change is indepen- 
dent of whether the cell already had one or more mutations. Further- 
more, there is no restriction on the number of cells that acquire beneficial 
mutations, thus allowing clonal interference to occur'*”’. 

Both modelling approaches led to the same general conclusion: the 
rapid adaptation of tetraploids results from both more frequent bene- 
ficial mutations and stronger fitness effects (Extended Data Fig. 2 and 
Methods). For the branching evolution model, these conclusions are 
independent of the assumed distribution of beneficial mutations, although 
there are differences in the magnitude of the best-fit values that are 
expected from the shape of the chosen distribution (Fig. 1d and Extended 
Data Fig. 2). Moreover, the conclusions are insensitive to the inclusion of 
deleterious mutations in the model (Extended Data Fig. 3 and Methods). 

To evaluate these conclusions experimentally, we performed whole- 
genome sequencing (WGS) to compare the frequency of mutations in 
the 1N and 4N ancestors with 74 evolved clones. In total, we identified 
240 de novo sequence variants (single nucleotide polymorphisms (SNPs) 
and small insertions/deletions): 45 from the 1N-, 69 from the 2N-, and 
126 from the 4N-evolved clones, an average of 2.05, 2.87, and 4.5 variants 
respectively per cell type (Supplementary Table 1). We observed signi- 
ficantly more variants per 4N clone than per 1N- and 2N-evolved clones 
(Fig. 2a; t-test, P< 1X 10 * and P= 0.0040, respectively). Note that 
these results are not a direct measurement of the mutation rate or bene- 
ficial mutation rate (), but rather the total number of mutations acquired 
during the experiment (see Supplementary Discussion). 

Sequence variants frequently occurred in genes encoding proteins in 
the Snf3/Rgt2 glucose-signalling pathway (SNF3, RGT2, MTH1, RGT1), 
as expected from previous yeast evolution experiments under carbon- 
source limitation” **. Several independent mutations in these genes 
resulted in either identical base-pair changes or altered the same amino 
acid (Supplementary Table 1). Non-synonymous SNF3 mutations were 
identified in all ploidy types, whereas loss-of-function mutations in 
MTHI were observed most frequently in the 1N-evolved clones. 

In addition to WGS, we used a combination of flow cytometry, micro- 
array comparative genome hybridization (aCGH), and quantitative PCR 
(qPCR) to measure the frequency of DNA copy number variations (CNVs) 
in the evolved clones. The only CNVs that arose in all three ploidy 
types was amplification of two adjacent genes encoding the high-affinity 
hexose transporters, HXT6 and HXT7, a frequently identified beneficial 
mutation in low glucose environments'*”””’. The HXT6/7 amplification 
was significantly more common in 2N- and 4N-evolved clones than 
in 1N clones (t-test, P = 0.005 and P = 1 X 10 4, respectively, Methods), 
which may be due to negative epistasis between HXT6/7 amplification 
and mutations in 1N cells, such as those in MTHI (ref. 24). 

Additional CNVs, including recurrent chromosome aneuploidy, were 
detected only in the 4N-evolved clones. With the exception of a small 
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Figure 2 | Tetraploid clones acquire frequent sequence variants, large-scale 
ploidy shifts and recurrent whole chromosome aneuploidy during 
adaptation. a, The number of sequence variants per clone was determined with 
WGS of 74 evolved clones (22 haploid, 24 diploid, and 28 tetraploid clones; 
Supplementary Table 1). The difference between tetraploids and haploids or 
diploids was significant (t-test, P< 1X 10 * and P= 0.004, respectively). 

b, DNA content (1C-4C) of evolved clones at generation 250, measured as 
the mean G1 propidium iodide fluorescence for each evolved clone (n = 88 
haploid and diploid evolved clones, n = 176 tetraploid evolved clones) in 
arbitrary units (a.u.). For reference, the DNA content of ancestral, control 
strains (1N, 2N, 3N, and 4N) is shown in grey. c, Heat map of chromosome 
copy number data obtained from aCGH and WGS for the ancestral and evolved 
1N, 2N, and 4N clones at generation 250; colour key at bottom. See Extended 
Data Figs 4-6, and Supplementary Table 2 for all individual clones). 


segmental amplification in one 2N-evolved clone, there were no CNVs 
or aneuploidy in the ancestral strains or the 1N- and 2N-evolved clones 
(Fig. 2b, cand Extended Data Figs 4 and 5). By contrast, many 4N clones 
underwent large shifts in ploidy (Fig. 2b) and all but two of the 4N-evolved 
clones were aneuploid at generation 250 (n = 30; Fig. 2c, Extended Data 
Fig. 6 and Supplementary Table 2). These alterations included large seg- 
mental aneuploidies with breakpoints at loci of transposable elements 
(Extended Data Fig. 7a). Pairwise patterns of chromosome copy num- 
ber alterations were observed, indicating that there is a strong copy 
number relationship between certain pairs of chromosomes (Extended 
Data Fig. 7b, c). Notably, increased copy number of chromosome XIII 
was significantly more common than all other aneuploidies (Extended 
Data Fig. 7d; Cochran-Armitage test, P< 1 X 10°’). These chromosome- 
level alterations were present early, at the time of CFP/YFP marker diver- 
gence in the 4N populations (~generation 45; Extended Data Fig. 8). 
Therefore, 4N-evolved clones had a higher frequency and greater diver- 
sity of mutations, supporting the inference from our mathematical 
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model that 4N-evolved clones have a relatively higher beneficial muta- 
tion rate. 

Next, we determined the effects of specific mutations on the fitness 
of the ancestral cells of differing ploidy. We first determined whether 
chromosome XIII gain contributed directly to the rapid adaptation of 
4N cells. Isogenic 2N and 4N strains, with and without an extra copy of 
chromosome XIII, were generated (Methods and Extended Data Fig. 9). 
The increased copy number of chromosome XIII provided a significant 
fitness increase to 4N strains specifically in raffinose medium relative to 
the 2N ancestor (Fig. 3a; t-test, P< 1X 10 *), not in glucose (Fig. 3b). 
This was not a general effect of aneuploidy because the gain of a differ- 
ent chromosome, chromosome XII, had the opposite effect on fitness 
(Fig. 3a). In striking contrast to 4N cells, chromosome XIII trisomy was 
not beneficial to 2N strains in raffinose medium and decreased fitness 
of 2N cells in glucose. Although increased fitness due to whole and seg- 
mental chromosome gain is known to occur during adaptation'*"*”, to 
our knowledge this is the first observation of a ploidy-specific fitness 
advantage for an aneuploid chromosome. Thus, aneuploidy, acquired 
through high rates of mitotic errors, is one way that 4N cells can acquire 
more beneficial mutations with higher fitness effects. 

Wealso characterized how ploidy impacts the fitness effect of recur- 
rently isolated mutations in SNF3, a gene encoding a plasma-membrane 
glucose sensor’. We identified SNPs that changed the codon for the 
same amino acid in the ninth transmembrane domain of Snf3p (G439E, 
G439V, G439R; Supplementary Table 1), and increased HXT expres- 
sion in raffinose (Methods)”’. By analysing the fitness of isogenic SNF3- 
G439E strains differing only by ploidy, we found that SNF3-G439E had 
a dominant, raffinose-specific, beneficial effect that was relatively stronger 
in the 4N strain (Fig. 3c, d; t-test, P< 1X 10 *). 

Ploidy-specific effect size of mutations could be an intrinsic property 
of polyploidy, as was recently suggested in plants", or it could be related 
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Figure 3 | Ploidy-specific fitness effects for certain beneficial mutations. 
Gain of chromosome (chr.) XIII is beneficial to tetraploid cells grown in 
raffinose medium but not for diploids. Shown is the fitness of isogenic wild-type 
2N and 4N strains, with or without chromosome XIII gain, relative to the 2N 
ancestor in raffinose (a) or glucose (b) medium. Error bars, mean + s.e.m. 
of four individual clones and two or three technical replicates. c, Competitive 
fitness of engineered isogenic strains of the indicated ploidy and genotype, 
relative to the corresponding ploidy ancestor, in raffinose and d, glucose 
medium. Error bars, mean + s.e.m. of three independent SNF3-G439E 
transformants of each ploidy type, t-test ***P <1 X 10~*. 
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to the fitness of the 4N ancestor relative to the 1N and 2N ancestors”*”’. 


To address the impact of initial fitness generally, we isolated 48 clones 
from the 4N evolution experiments at generation 250 (4N250) with 
fitness values equal to the 2N ancestor (competitive fitness difference 
<0.05), and determined the speed of their next adaptive step. We com- 
pared the fitness acquired by the selected 4N250 clones after an additional 
250 generations with that of 2N clones evolved for 250 generations 
(2N250, n = 160). Despite comparable starting fitness, the 4N-derived 
clones still underwent more rapid adaptation and achieved significantly 
higher fitness. This occurred irrespective of large-scale shifts in ploidy: 
29% of the 4Nsoo clones maintained a ploidy of 3N-4N and acquired 
higher fitness than the 2N59 clones (Fig. 4, Kolmogorov-Smirnov test, 
P<1X 10 °);71% of the 4Nsog clones underwent chromosome loss to 
become near-diploid and acquired even higher fitness relative to 2N250 
clones (Fig. 4, Kolmogorov-Smirnov test, P< 1 X 10~*). Thus, the rapid 
adaptation of tetraploid cells was at least partly independent of their 
initial fitness. 

Here, we measured the acquisition and spread of beneficial muta- 
tions in isogenic yeast populations that differed only by ploidy. Mathe- 
matical modelling enabled us to infer parameters driving the evolutionary 
dynamics of these strains and indicated that in a poor carbon-source 
environment, polyploidy increases the rate and fitness effects of the 
acquired mutations. Polyploidy increased the genetic diversity of the 
population. We identified examples of mutations that are selectively bene- 
ficial in polyploid strains, including whole chromosome aneuploidy. 
Because aneuploidy itself is mutagenic’*, the high rates of aneuploidy 
induced by whole-genome duplication may further increase the rate at 
which beneficial mutations are acquired. If these mutations are bene- 
ficial at lower ploidy states, then the long-term benefit of polyploidy will 
be preserved, even if polyploidy is transient during adaptation. Indeed, 
4N-evolved clones that became near-diploid had higher fitness than 
the 2N-evolved clones. Moreover, although we only studied one envi- 
ronmental condition, polyploidy buffers the effects of partly recessive 
deleterious mutations'*”, which in principle can then accumulate’, pro- 
viding a reservoir of mutations that might be adaptive in a new environ- 
ment. Interestingly, the evolved tetraploid karyotypes closely resemble 
the polyploid and aneuploid karyotypes of fermentation, industrial, 
baking, natural desert isolates”, and antifungal drug-resistant yeasts“, 
consistent with a role for polyploidization events during adaptation to 
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Figure 4 | Rapid adaptation of tetraploids normalized for initial fitness. 
Fitness of 2N and 4N clones relative to the 2N ancestor. Evolved tetraploids 
(4No250) with fitness equivalent to the diploid ancestors were identified and 
passaged for another 250 generations to generate 4Nso9 clones (m = 48). The 
fitness of these 4Nso9 clones was then compared with the fitness of evolved 
diploids after 250 generations (2N250, 2 = 160, replicate experiments A and B). 
ANsoo Clones reached a higher fitness than 2N»50 clones, irrespective of whether 
the 4Ns0o clones maintained a 3N-4N DNA content (n = 14, Kolmogorov- 
Smirnov test, **P <1 X 10°) or underwent large-scale chromosome loss to a 
near diploid chromosome content (n = 34, Kolmogorov-Smirnov test, 
***D <1 X 10 $). Error bars, mean + s.e.m. 
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these stressful environments. Thus, the genetic plasticity of polyploid 
cells, together with ploidy-specific beneficial effects, can facilitate rapid 
adaptation. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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Batch culture evolution experiment. All Saccharomyces cerevisiae strains used in 
this study were in the S288c background (detailed information on strain construc- 
tion is provided in the section entitled ‘Yeast strain construction’). Briefly, the iso- 
genic ploidy series was generated in a matA ste4A background to eliminate mating 
and meiosis during the course of the experiment. Either a pGAL-CFP or a pGAL- 
YFP construct was integrated at the TRP1 locus near the chromosome IV centro- 
mere in a haploid strain (PY5998 and PY5999, respectively). These haploid strains 
were used to generate isogenic diploids, from which isogenic tetraploids were then 
derived (Extended Data Fig. 1). This procedure ensured that all copies of chromo- 
some IV had the capacity to express the inducible fluorescent marker even if the 
strains became aneuploid. Mating-competent haploids were generated from the 
matA ste4A ancestor, PY5998, by transformation with either plasmid PB2647 (CEN- 
LEU2-STE4) or PB2648 (CEN-URA3-STE4-Matx). Zygotes from mating-competent 
haploids were isolated by micromanipulation to obtain diploid CFP ancestors (PY6008 
and PY6022). Similarly, zygotes from mating-competent diploids were isolated 
by micro-manipulation to obtain tetraploid CFP ancestors (PY6031 and PY6032). 
The same mating scheme was performed for the YFP lineage starting with PY5999 
to generate diploid YFP (PY6006 and PY6014) and tetraploid YFP (PY6040 and 
PY6045) ancestors. 

The ancestor strains were grown to saturation from the —80 °C stock, in Syn- 
thetic Complete + 2% glucose. The cell density of each ancestor was determined 
using a haemocytometer and an automated cell counter (Vi-Cell-XR from Beckman 
Coulter). An equal number of YFP and CFP cells of the same ploidy were diluted 
into fresh Synthetic Complete + 2% raffinose medium, and combined into a single 
tube for an initial concentration of 1 X 10° cells per millilitre. The 50:50 YFP:CFEP 
culture was distributed equally into the wells of a 96-deep-well plate (1 ml per well, 
U-bottom block plate from Qiagen). Seven or eight wells were not inoculated, to 
detect cross-well contamination during the experiment. The plates were covered 
with ‘breathe-EASIER’ tape (Electron Microscopy Science) and incubated at 30 °C 
ona 96-well plate shaker (Union Scientific). Two plates of haploid and three plates 
of diploid and tetraploid cells were analysed, representing 173 parallel haploid evo- 
lutions, 264 parallel diploid evolutions, and 265 parallel tetraploid evolutions. 

At 24h intervals, the cells were resuspended (by pipetting) and diluted into fresh 
Synthetic Complete + 2% raffinose medium. The dilution factor was determined 
for each ploidy type based on the initial strain fitness to maintain an equivalent pop- 
ulation size, as reported previously”. The number of cells transferred each day was 
calculated by counting the number of cells in ten replicate wells of each ploidy before 
and after dilution with an automated cell counter (Vi-Cell-XR from Beckman Coulter), 
and averaged across 3 consecutive days. The dilution factor for the haploid, diploid, 
and tetraploid experiments was 1/100, 1/50, and 1/33, respectively. This corresponds 
to 6.64, 5.64, and 5.04 generations per day'’. The tetraploid evolution experiment 
from generation 250 to 500 (Fig. 4) was performed with the same dilution factor as 
the diploid experiments (1/50). 

The number of CFP and YFP cells in each population was measured at the same 
time each day. First, expression of the fluorescent proteins was induced by trans- 
ferring 10 ul of the overnight culture into 200 jl Synthetic Complete + 2% galactose 
medium for 4h at 30 °C. The number of CFP- or YFP-expressing cells was deter- 
mined using a BD LSRII flow cytometer high-throughput plate reader (10,000 cells 
were analysed from each well). Pacific Blue and FITC filters were used to detect CFP 
and YFP, respectively. All experiments were passaged for 250 generations, but daily 
acquisition of CFP:YFP ratios was not always continued to the 250th generation. 

To ensure that the flow cytometer measurement and the galactose induction of 
CFP and YFP was an accurate reflection of the size of these populations, the ratio of 
CFP:YFP cells was determined by both flow cytometry and microscopy, and the 
ratio was determined both before and after galactose induction. To do this, we com- 
bined overnight cultures of the 1N, 2N, and 4N ancestor CFP and YFP strains at 
three different ratios (nine populations in total) and analysed the ratios in two ways. 
First, for an aliquot of the mixture, we induced the expression of the fluorescent 
proteins with 2% galactose for 4h and analysed 10,000 cells using flow cytometry. 
In parallel, we also added 2% galactose for 4h and then counted about 300 cells by 
fluorescence microscopy. Finally, to ensure that the induction with 2% galactose 
did not alter the CFP: YFP ratio, a portion of the population was used to determine 
the number of CFP and YFP cells in the population before adding galactose to the 
medium. To do this, cells from each population were struck for single colonies on 
YPD plates for 2 days. Ninety-six colonies were chosen randomly from each plate 
and added to a single well of a 96-well plate containing Synthetic Complete + 2% 
galactose. The fluorescence of each colony was determined by flow cytometry, and 
the percentage YFP of the initial population was determined. There was a strong 
correlation between the percentage YFP-expressing cells obtained from all three 
measurements (Extended Data Fig. 3a): including the flow cytometer and fluores- 
cence microscopy (Pearson correlation coefficient = 0.979), and both before and 
after galactose induction (Pearson correlation coefficient = 0.985). 
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Finally, frozen stocks of the evolution experiments were made at 3- to 4-day 
intervals throughout the experiment. At the end of each experiment, single colony 
clones were isolated and used for competitive fitness assays, flow cytometry anal- 
ysis of ploidy, and preparation of DNA for aCGH. 

Weisolated 48 clones from the 4N evolution experiments at generation 250 (4N>50) 

with fitness values equal to the 2N ancestor (competitive fitness difference < 0.05), 
and determined the rate of adaptation after an additional 250 generations (Fig. 4). 
Each 4N)59 clone was grown to saturation from the — 80 °C stock in Synthetic Com- 
plete + 2% raffinose medium. Cell counts were performed as above, and each pop- 
ulation was diluted to an initial concentration of 1 X 10° cells per millilitre. At 24h 
intervals, the cells were resuspended (by pipetting) and diluted 1/50 into fresh 
Synthetic Complete + 2% raffinose medium (the same dilution factor as the diploid 
experiments). These evolution experiments were not performed as CFP:YFP com- 
petitions, so daily flow cytometry was not necessary. After 250 generations, single 
colony clones (4N509) were isolated on Synthetic Complete + 2% raffinose plates. 
Each 4Nso9 clone was cultured overnight in 1 ml Synthetic Complete + 2% 
raffinose medium and aliquots of this culture were immediately used for compet- 
itive fitness assays, flow cytometry analysis of ploidy, preparation of DNA for 
aCGH, and frozen stocks. 
Measuring the variation in the flow cytometer measurements. We determined 
the amount of noise in our flow cytometer measurements by calculating the mean 
and standard deviation of the percentage YFP obtained from 48 independent pop- 
ulations at six different ratios of CFP: YFP, for each ploidy type. The 1N, 2N, and 
4N ancestor strains were cultured separately overnight in 2% raffinose medium 
and transferred to 2% galactose for 4 h to induce expression of CFP and YFP. Next, 
CFP and YFP cells of the same ploidy were combined at ratios of 100:0, 85:15, 75:25, 
50:50, 25:75, and 0:100 to reach the same final volume (200 pl). Ten thousand cells 
from each population were analysed by the LSRII (BD) flow cytometer using the 
same parameters (for example gating and flow rate) that we used for the evolution 
experiments, the total numbers of CFP and YFP cells were obtained, and the per- 
centage YFP was calculated ((number of YFP cells)/(number of CFP + number of 
YFP cells)) X 100. The standard deviations are presented in Extended Data Fig. 3b, 
and indicate that there was little well-to-well variability for the same CFP:YFP 
ratio (across 48 wells), and that this variability changed only slightly across differ- 
ent CFP:YFP ratios of all three ploidy types. Importantly, it was never greater than 
0.66% of the measurement. This small variability had a minimal effect on our analysis 
because the fitting procedure we used to measure the deviation from an equal per- 
centage of CFP and YFP cells used bins of 5% deviation to combine the number of 
wells that had deviations between 0 and 5%, 5 and 10%, and so on, for each experiment. 
Statistical analysis of the experimental data. No statistical methods were used to 
predetermine sample size. 

Adaptation rate. The CFP versus YFP evolution experiments were designed to 
analyse the dynamics of the adaptation process at the population level within and 
across wells. The adaptation rate was determined with additional competition exper- 
iments that were designed to measure the change in fitness from generation zero to 
generation 250 for cells of each ploidy. To this end, we isolated single colony clones 
from each evolved well of different ploidy types at generation 250 and measured 
their competitive fitness relative to the 2N ancestor (see detailed methods below). 
Fitness was defined as the slope of the log of the ratio of the evolved clone to the 
reference strain (2N ancestor) over time; more precisely, we performed a linear 
least-squares fit of log(Ny/Njo) over multiple dilution cycles (where Ny, is the number 
of cells from the evolved clone and N,g is the number of ancestor cells). The fitness 
relative to the ancestor is defined as s = d/dt [log>(Nu/Nio)], where t is measured 
in days'’. The rate of adaptation was the relative fitness at generation 250 minus 
the relative fitness of generation zero, divided by 250 generations. The rate of adap- 
tation for each ploidy is shown in Fig. 1c. We found that the tetraploid populations 
had a significantly larger rate of adaptation (0.009 [0.0062, 0.011]) than haploids 
(0.0031 [0.0022, 0.0052]) or diploids (0.0031 [0.0018, 0.0041]) during the 250 gen- 
erations in raffinose medium (values indicate the median rate followed by the 95% 
confidence intervals in square brackets, Fig. 1c, t-test, P< 1X 107 ?°), 
Mathematical modelling of population dynamics. Branching evolution model. 
We formulated a mathematical model of the population dynamics of cells that was 
then used to infer evolutionary parameters using the experimental data. This branch- 
ing evolution model was designed to mimic closely the divergence experiments 
containing two equally fit populations (CFP or YFP), each initially consisting of 
50,000 cells. The model is based on a stochastic birth and death process called a 
branching process*’. In this process, at each time-step, a cell is chosen to die at 
random, or to divide with a probability corresponding to its fitness. During each 
cell division, a mutation arises with mutation rate ju. If no mutation occurs, the fit- 
ness of the daughter cell is equal to the fitness of the mother cell. Ifa mutation does 
occur, the fitness may change; the additive fitness of the daughter cell is then chosen 
from a fitness distribution. The fitness change is independent of whether the ances- 
tors of this clone had already obtained one or more mutations. Furthermore, there 
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is no restriction on the number of cells that acquire beneficial mutations, thus 
allowing clonal interference to occur'®”'. 

We compared results assuming either a uniform, exponential, or delta distribu- 
tion of fitness values (Extended Data Fig. 2a—d). For the initial formulation of the 
model, we considered half of the newly arising mutations to be beneficial and half 
deleterious, and their fitness effects were considered to be additive to the fitness value 
of the mother cell. Because complete simulations of the branching process would be 
prohibitively slow, we approximated this branching process with a Wright-Fisher 
process’” with non-overlapping generations. This process was implemented as a 
Monte Carlo simulation in C+ + and the code is provided as a Supplementary 
Software file. 

In the simulation, each competition experiment was initiated with 1 X 10° cells 
(5 X 10* of each CFP and YFP cell type). At every generation, each cell reproduces 
and gives birth to a random number of surviving offspring distributed according to 
a Poisson distribution. The initial population of each ploidy had a different growth 
rate per day: the haploid population increased approximately 100-fold, the diploids 
about 50-fold, and the tetraploids about 30-fold in a 24h time interval (described 
previously in the section entitled ‘Batch culture evolution experiment’). Assuming 
a population doubling every generation, the average number of generations per day 
was 6.64, 5.65, and 5.04 for haploids, diploids, and tetraploids, respectively. In the 
Wright-Fisher model, the number of generations is discrete; thus we rounded these 
numbers to the closest integers (7, 6, and 5, respectively). The initial fitness (f) of 
each ploidy was chosen to satisfy the growth rate of that ploidy with this number of 
generations: that is, (fin)” = 100, (frm)® = 50, (fan)? = 30. The fitness of a cell is the 
average number of its surviving offspring in the next generation. 

During each cell division, a new mutation might arise. In haploid cells, the impact 
of a mutation is given by its additive fitness value s drawn from the fitness distri- 
bution. In diploid and tetraploid cells, however, a mutation might have a degree of 
dominance, be recessive or have a different fitness effect than it would have in a 
haploid cell. Following ref. 2, we assumed that the effect of a given mutation in a 
diploid or tetraploid cell compared with that in a haploid cell is scaled by a certain 
factor h. Therefore, the fitness effect of a given mutation in one allele will be s=hs. 

In our model, we considered a beneficial mutation to arise independently at rate 
Lt (per whole genome). Because our populations proliferated asexually, we could 
not differentiate between the two components of s, namely the haploid fitness 
effect s and the dominance coefficient h, and therefore we could only infer the com- 
bined value of s. Our simulations were done in exactly the same way for all ploidy 
types, and the selection coefficient of a new mutation was taken from a given dis- 
tribution. However, the meaning of the added fitness is different for the haploids, 
where it is s, than for the diploids and tetraploids, where it is s. The assumption of 
independence of mutations is justified by the low point mutation rate, which is of 
the order of magnitude of 2 x 10~'° per base per generation in yeast*”. The low 
per-base mutation rate means that the probability of independently obtaining a 
second identical mutation is vanishingly small (probability of y’, or 4X 10° 7°) 
and therefore we did not consider such events in the model. Similarly, the prob- 
ability of obtaining a given mutation that is then copied by a recombination-based 
mechanism such as gene conversion* is low ((2 X 10~!°) x (4X 107), and was 
not included. These assumptions are validated by our WGS data demonstrating 
single copies of all point mutations. 

Our identification of specific mutations that have a larger fitness effect in the 
tetraploid strains than in the haploid or diploid strains (Fig. 3) is consistent with 
the overall larger fitness effect of the tetraploids in our experiment. Note, however, 
that our parameters describe the entire distribution of mutations, rather than any 
specific mutation. Thus, it is also possible that some mutations could have a larger 
fitness effect in the haploids or diploids relative to the tetraploids. Note also that in 
our model clonal interference can occur, as there is no limitation on the number of 
independent mutations that can arise within a population. Thus multiple clones can 
emerge and compete with each other. As shown previously, clonal interference is an 
important aspect of microbial population dynamics and thus cannot be ignored'”'*". 
Additionally, multiple mutations can exist in the same cell. 

Asa sensitivity analysis, we also varied the ratio between advantageous and dele- 
terious mutations, and found that even when the fraction of the deleterious muta- 
tion is very large, their inclusion has only a negligible effect on the rate and dynamics 
of adaptation. This is consistent with a large body of prior literature'”****. To 
accomplish this, we fixed the rate of the advantageous mutations and generated 
data sets with different ratios of beneficial to deleterious mutations: 100:0, 90:10, 
50:50, 10:90, and 1:99. For each ratio we generated 100 data sets (each data set with 
264 single deviation experiments) using a beneficial mutation rate of 1.2 X 10° 
and a fitness effect of s = 0.16 (the best-fit values of the delta function for the diploid 
experiments, as a typical example, we used the same distribution as in ref. 17), and 
fitted them against the simulations that we used to fit the empirical experiments 
(see below for the fitting procedure). The means of the fitted values are presented 
in Extended Data Fig. 3c. We found no significant difference between the different 
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ratios of beneficial to deleterious mutations (t-test, P > 0.2 between all ratios tested). 
Therefore, in our model, we assumed that half of all non-neutral mutations were 
deleterious and half were beneficial, and that the deleterious and beneficial muta- 
tions had the same fitness effect distribution with the same parameters. Note that 
we also obtained similar results using the equivalence principle model” that did 
not include deleterious mutations (see below). 

The upper bound of cellular fitness. For biological plausibility, it was necessary 
to set an upper bound for acquired fitness. On the basis of the well-described growth 
rates achievable for S. cerevisiae in optimal conditions, we set this boundary at a 
doubling time of 1 h (2 per day). Given that gis the number of generations a strain 
experiences in a day, the upper bound of the fitness f of any ploidy type was there- 
fore set to satisfy the equation f* = 2*. 

The initial growth rate of the tetraploid cells was lower than the growth rate of 
the haploids and diploids (see previous section entitled “Batch culture evolution 
experiment’). In the simulations we rounded the number of generations to satisfy 
the assumption of non-overlapping generations in the Wright-Fisher model: the 
tetraploid cells underwent about five generations per day, whereas the haploid cells 
underwent around seven, and diploid cells around six generations per day. In the 
simulations, as in the experiments, we diluted the populations every day by choos- 
ing at random 1% of the haploid, 2% of the diploid, and 3.3% of the tetraploid cells. 
This dilution was done by using a hypergeometric random generator for popula- 
tions smaller than 100 million (http://www.agner.org/random/); for populations 
larger than 100 million this method is not applicable and we used a direct Bernoulli 
sampling of cells’, forcing the total sampled cells to be 1%, 2%, or 3.3% for haploid, 
diploid, and tetraploid populations, respectively. The concentration of each of the 
cell types was recorded in the simulation output at the end of every day, after dilution. 

In the experiments with the diploid and tetraploid cells, we observed an initial 
small bias against the YFP-labelled population: in the diploid cells there was a decline 
in the YFP-labelled population of 0.5% per day, and in the tetraploid cells there 
was a 1% decline per day (Fig. 1b). We included these biases in the simulations by 
including this initial small difference in the fitness (that is, a reduced average number 
of offspring) of CFP- versus YFP-labelled cells'””*. 

Note that drift could not feasibly play a role in our experiment, as the timescale 
(in generations) for fixation of an allele due to drift is approximately equal to the 
effective population size**. This time was of the order of 10° generations in our 
experiments’’, much larger than the timeframe of our experiment (250 generations). 
Thus the simulations continued until one of the cell types overtook the whole pop- 
ulation or until the end of the time that the CFP: YFP data were collected on the 
30th day; the first of either event terminated a simulation. To increase the efficiency 
of the simulations, extinction of a certain colour (and thus fixation of the other) 
was defined not bya value of zero frequency, but as a frequency ofless than or equal 
to 1%, as the probability was negligible that a sub-clone present at a frequency of 
1% would overtake another sub-clone present at a frequency of 99% in the time- 
frame of our experiment”’. Furthermore, while the variability between flow cytom- 
eter measurements was never greater than 1% of the measurement (see above, and 
ref. 26), our ability to detect changes in CFP or YFP populations below a frequency 
of 1% was a limitation of the flow cytometer. 

Fitting procedure. The empirical data for each ploidy type were combined to 
represent the average deviation from an equal percentage of YFP and CFP cells. We 
then used the combined ploidy data to estimate the best-fit values of mutation rate 
(u) and selection coefficient (s), using least squares fitting. 

To compare data from the experiment and the simulations, we investigated several 
different summary statistics and used the one that performed best. The summary 
statistics evaluated were as follows. 

1. Mean deviation from an equal percentage of CFP and YFP cells: for every exper- 
iment corresponding to an individual well in a 96-well plate, we calculated the devi- 
ation from equal percentage every day. Then, for every data set generated (experiment 
and simulation) we calculated the average deviation per day. This procedure created 
a vector of the mean deviations per day for every data set. 

2. Mean and standard deviation of the deviation from an equal percentage of 
CFP and YFP: as in 1, except that we also calculated the standard deviation of the 
deviation from equal percentage for every day. This procedure generated two vectors 
for every data set. 

3. A distribution of the deviation from an equal percentage of CFP and YFP with 
ten bins: we calculated the absolute value of the deviation from equal percentage 
for each deviation experiment for each day. Then for every day, we binned the devi- 
ation values into ten bins, each with a size of 5%. That is, we counted the number of 
wells that had a deviation between 0-5%, 5-10%, ..., 45-50%. This procedure gen- 
erated a matrix of ten bins X 30 days for each data set. 

4. A distribution of the deviation from equal percentage CFP and YFP with three 
bins: similar to (3), except we used only three bins, one for the non-deviated wells 
(defined as 0-0.1), one for fixated wells (defined as 0.4—0.5) and one for those in 
the middle (0.1-0.4). 
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The comparison between the experiments and the simulations was done by cal- 
culating the sum of squares (SOS) between the summary statistic (SS) of the exper- 
iment and the summary statistic of the simulations: 


i ¢ ‘ 2 
SOS.) = > (ss *(Q) _ ssfimulaton(2)) 
Q 


where Q spans the day and the number of values that each summary statistic has 
each day. The best-fit pair was the set of ju and s values with the smallest sum of squares. 
To determine the performance of the different summary statistic, we generated 1,000 
data sets with the same parameters (1N, uniform distribution with s = 0.05, and 
L= 8*10 °), each containing 264 single deviation experiments, and inferred their 
values by scanning the parameter space for each of the 1,000 artificial data sets. The 
expected values from the simulations were then generated by 1,000 single deviation 
experiments for each set of parameters. 

The distribution of inferred ji values is shown in Extended Data Fig. 3d. Whereas 
all of the summary statistics had the mutation rate used in the simulations as their 
mode, the observed means varied modestly. The ten-bin summary statistic had the 
narrowest range, and therefore this was used for further analysis. A similar sum- 
mary statistic was also used previously”. 

The scanned range of mutation rates and fitness effects. The best-fit value was 
found by scanning a range of mutation rates and fitness effects”*; to this end, we 
scanned the parameter space of the fitness effect in linear steps from 0.005 to 0.35, 
with increments of 0.005. The mutation rates were scanned in logarithmic steps 
from log) o(u) = —8 to logio() = —4, with increments of 0.1. Thus, per ploidy per 
fitness distribution, we scanned 2,460 parameter regimes, analysed 1,000 divergence 
simulations per jz and s combination, and scanned three ploidy types and three 
different fitness distributions in total. 

For the exponential fitness distribution, larger mutation rates and smaller selec- 
tion coefficients needed to be investigated to find the best-fitting pair: we scanned 
mutation rates from log; o(4) = —8 to log) o(4) = —3, and selection coefficients from 
0.002 to 0.1, with increments of 0.002. Additionally, in the exponential distribution, 
the majority of mutations had a fitness value that was too small to contribute to the 
competition between the two cell populations (YFP and CFP). However, increas- 
ing the mutation rate would increase the probability that mutations with larger fit- 
ness value were obtained. We found that for mutation rates higher than 10 * per 
genome per cell division, the computational time constraints were substantial; thus 
we excluded all mutations that had an s value smaller than 10% of the average fit- 
ness effect s. Support for this choice was provided by the finding that for large muta- 
tion rates (that is, Nu > 1), the mutations chosen from an exponential distribution 
that eventually reach fixation within the population had a larger fitness effect than 
the distribution average (ref. 20 and Fig. 3). The above criteria for large mutation 
rates were within our experimental regime (Nu = 10° X 10-* > 1). Furthermore, 
in Extended Data Fig. 3e we present simulation results of the average deviation 
from equal percentages of CFP and YFP for a given mutation rate and fitness effect 
drawn from an exponential distribution. We found that the results were robust to 
including or excluding those mutations with fitness effects smaller than the distri- 
bution average. Thus, for high mutation rates (Nu > 1), we could exclude weak 
mutations, as was shown in ref. 20. 

The fitting of the empirical experiments and the simulations was done as described 
above. The results of the best-fit values are presented in Extended Data Fig. 2a-d. 

The equivalence principle model. As a complementary approach we evaluated 
the equivalence principle model developed in ref. 17 to analyse a similar competi- 
tion experiment in Escherichia coli. This study concluded that the mutations that 
eventually reach fixation in large microbial populations have a very narrow range 
of fitness values. Very weak mutations are unlikely to lead to a takeover of one pop- 
ulation (CFP or YFP) by the other, whereas very strong mutations are very infre- 
quent. On the basis of this prediction, the competition experiments were described 
by assuming that all beneficial mutations have exactly the same fitness value; that 
is, assuming that the distribution of fitness effects is a delta function. On the basis 
of this assumption, the authors developed a method for inferring the two key param- 
eters of the dynamics of the adaptation process: the mutation rate of a beneficial 
mutation and the single fitness effect value. Note that the assumption of a single 
fitness value for beneficial mutations was shown to be true only if the mutation rate 
is very high”, which cannot be known a priori. This is why the branching evolution 
model was implemented first and then compared with the equivalence principle 
model. Note also that the equivalence principle model analyses only the initial 
CFP: YFP divergence phase, rather than using the entire data set as in the branching 
evolution model. Although this excludes some data, it has the advantage of not 
necessitating the assumption that the distribution of fitness effects is constant*’. 

We followed the procedure outlined in ref. 17, including the following assump- 
tions and modelling steps (see the Supplementary Information in ref. 17 for a more 
complete description). 
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1. Rather than the computationally intensive simulation of growth and dilution, 
we used the effective population size*’. The effective population size is N. = No log(r)T, 
where Np is the initial population size, r is the growth rate of the population, and T 
is the number of generations between dilutions. In addition to reducing computa- 
tion time, the use of the effective population size allows for an analytical approxi- 
mation of certain quantities, such as the probability of escaping drift. 

2. Even large mutations may be eliminated by drift, and only those that escape 
drift can contribute to the competition between the two labelled cell populations. 
The probability of emergence of a new mutation that will escape drift can be cal- 
culated analytically for a fixed population, making it possible to generate in simula- 
tions only those mutations that escaped drift, and contribute to the competitiveness 
of the population. Under the above simplifying approximations of (1) a fixed pop- 
ulation size and (2) a delta distribution of fitness effects, the probability that a muta- 
tion will emerge and escape drift (Pescape) Was calculated analytically, as a function of 
the mutation rate, ju (the rate for a new mutation to appear ina single cell division), 
the selection coefficient, s, and the effective population size, N.. 

3. Given that Pescape can be calculated, simulations can be performed rapidly by 
generating the time in the experiment when such mutations occur, rather than 
randomly assigning mutations at each cell division. 

We applied the equivalence principle model as follows. To compare the experi- 
ments with the simulations, as in ref. 17, we used time and initial slope of CFP: YFP 
ratio divergence as summary statistics for the divergence curves. Each CFP:YFP 
divergence experiment (one single well from a 96-well plate) was fit to an exponen- 
tial growth model, with slope « that starts at a given time t, using the expression 
-log)o(1 + 0.5exp(a(t — t))). An exponential model is a good description of the 
initial divergence phase. The end of the exponential growth phase was defined by the 
time that had the maximum likelihood to be described by exponential growth (see 
the Supplementary Information in ref. 17 for more details). The initial CFP: YFP 
bias that we detected in some 2N and 4N populations was included in cases where 
it occurred. 

Each divergence experiment is thus described by its values t and «, whether it is 
a real experiment or a simulated one. Because a single divergence experiment is 
subject to significant stochasticity, we compared the empirical experiments and 
the simulations by combining all the wells of each ploidy type. This was done by 
collecting all of the x values of a given experiment or a given set of simulations (that 
is, those simulations that we generated by the same pair of yu and s values) into a 
distribution of « values. The experimental distribution was compared with the dis- 
tribution of the simulations by a Kolmogorov-Smirnov test between the two. The 
same was done for the t values. For each pair of s and ju values, we calculated the 
sum of the Kolmogorov-Smirnov test of its « distribution and the Kolmogorov- 
Smirnov test of its t distribution. The pair of values (11, s) that had the smallest sum 
was declared the best fit (Extended Data Fig. 2a). This was done independently for 
each ploidy type (Extended Data Fig. 2e). 

Modelling results. The fitting of the empirical experiments and the simulations 
was done as described above. The results of the best-fit values from the branching 
evolution model and the equivalence principle model are presented in Extended 
Data Fig. 2a. To generate the error range for each distribution and ploidy type, we 
used the parametric bootstrapping method. For each estimated set of values, we 
generated 1,000 simulated data sets and compared those with the empirical data 
sets (data set sizes were 172, 264, and 265 independent deviation experiments for 
the haploids, diploids, and tetraploids, respectively). We then inferred the best-fit 
values of those data sets. The 95% confidence intervals of ju and s from those 1,000 
data sets were defined as the error ranges (Extended Data Fig. 2a-e). 

We infer from these results that the tetraploids have a higher mutation rate and 
that these mutations have, on average, a stronger fitness effect compared with hap- 
loids or diploids. The trend for tetraploids to have higher and s values occurs inde- 
pendently of the different assumed distributions of fitness effects. However, the 
different distributions lead to significantly different absolute values. This is expected 
from the characteristic shape of these distributions. To illustrate why this result is 
expected, we show a schematic diagram of the three distributions of fitness effects 
(Extended Data Fig. 2f). 

The mutations that mainly govern the adaptation process are hypothesized to 
come from a relatively narrow range of fitness effect values that are sufficiently 
strong, but not extremely rare (Extended Data Fig. 2f, double arrow region from 
the idealized narrow Gaussian curve). The three assumed distributions that we used 
in our modelling approximate the true distribution but with the following differ- 
ences. The delta distribution is located in the centre of the double arrow region. 
Thus, by definition, every mutation from the delta distribution will have a fitness 
effect that is strong enough to have a significant probability of promoting adapta- 
tion. If every mutation has the possibility of contributing, then a lower mutation 
rate suffices for adaptation. By contrast, the exponential distribution is dominated 
by small-effect or near-neutral mutations, with a relatively low fraction of mutations 
near the value for the delta distribution. Thus, the assumption of an exponential 
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distribution is accompanied by a requirement for a compensatory higher muta- 
tion rate to achieve numbers of equivalently beneficial mutations. By the same line 
of reasoning, the uniform distribution necessitates a rate of beneficial mutations 
that is intermediate between the exponential and delta distributions. Indeed, our 
results match all of these expectations (Extended Data Fig. 2a-d). A similar differ- 
ence in the estimation of j1 was also observed in another recent study that modelled 
this value based on these three assumptions about the distribution of beneficial 
mutations®. 


In terms of the fitness effect values of the different distributions, the uniform 
distribution is governed by its upper limit, which, by definition, is the strongest muta- 
tion allowed. Therefore, it is expected that the uniform distribution mean, which is 
half of the upper limit, will be half of the fitness effect value of the delta function. 
Again, this is what we observed (Extended Data Fig. 2a—d). The exponential distri- 
bution mean, is much smaller than the strongest mutation that can be generated by 
the exponential distribution, and the best-fit parameters are expected to be much 
smaller than the other two distributions, which is also observed (Extended Data 
Fig. 2a-d). Furthermore, because the exponential distribution has no upper bound 
on the fitness effects, a larger mutation rate can lead to the emergence of mutations 
with much larger fitness effects. In this way, under the exponential distribution 
assumption, the mutation rate affects the range of possible fitness effects, which 
results in a larger error range for the exponential distribution compared with the 
other distributions (Extended Data Fig. 2a, brackets). 

In summary, despite expected differences in absolute values, we reach the same 
overall conclusions with either the branching evolution model with varied assump- 
tions about the distribution of beneficial mutations or the equivalence principle 
model”. 

Computer code availability. The computer code is available as a Supplementary 
Software file. The code was compiled by g+ + (version 4.2.1) and was tested on Unix 
(CentOS5 operating system) and Mac (OS X version 10.9.2) machines. 

Plasmid construction. All plasmids used in this study are listed in Extended Data 
Table 1. To construct plasmids for the inducible expression of either CFP or YFP, 
the galactose-inducible GAL 1 promoter was subcloned into the YFP plasmid PB1500 
and the CFP plasmid PB2452. These plasmids were derived from the green fluo- 
rescent protein (GFP) tagging plasmid generated in ref. 44. Both plasmids contained 
the ADH gene terminator (tADH) after the YFP or CFP gene and the sequence of the 
SpHISS gene of Schizosaccharomyes pombe as a selectable marker. Plasmids PB1500 
and PB2452 were digested with BamHI and Pacl to introduce the pGAL promoter, 
461 base pairs (bp) upstream of the start codon of GALI (ref. 45), which was 
amplified using the primers pGAL1 BamHI 5’ (5'-ACGGATCCCCGGGTTGAA 
GTACGGATTAGAAGCCGCCGAG-3’) and pGAL]I Pacl 3’ (5’-CGTTAATTA 
ATATAGTTTTTTCTCCTTGACGTTAAAG-3’). Site-directed mutagenesis (Quick 
Change Mutagenesis Kit, Stratagene) was used to introduce an ATG translation 
start codon to the YFP and CFP genes (using the GAPATGpFA6 primer, 5'-CAA 
TCAATCAATCAATCATCACATAAATTAATTAAATGAGTAAAGGAGAAG 
AACTTTTCACTGGAGTTGTC-3’). The resulting plasmids PB2694 and PB2697 
contained the cassette pGALI1-CFP-tADH-SpHIS5 and pGAL1-YFP-tADH- 
SpHISS, respectively. 

PB2314 was used to delete the MAT locus as previously described’*. PB1308 was 
used to perform a URA3 to TRP1 marker swap, as previously described**. PB1640 
(hphMX4, ref. 47) was used for PCR-mediated deletion of STE4. PB2647 (STE4- 
LEU2) was used to restore mating competency and was constructed by amplifying 
the STE4 gene with primers STE4 P BamHI 5’ (5'-CCGGATTCTTGTAGCCCTG 
TTAGGTTTACC-3’) and STE4 T BamHI 3’ (5'-CCGGATTCCAATACATAAG 
GACGAGCCAGTG-3’), and cloning it into pRS315. PB2649 (STE4 URA3 CEN 
MAT«) was also used to restore mating competency, and was constructed by sub 
cloning the STE4 fragment from PB2647 (digested with Smal and NotI) into PB2577 
(MATa URA3 CEN, digested with Smal and Not). 

Yeast strain construction. All S. cerevisiae strains used in this study were isogenic 
to PY3295 (BY 4741, S288c genetic background MATa his3A leu2A met15A ura3A) 
and are listed in Extended Data Table 1. The strategy used to generate the isogenic 
ploidy series is illustrated in Extended Data Fig. 1 and genotypes of key intermedi- 
ates are indicated. The CFP and YFP ancestors were derived from the haploid strain 
PY5997 (matA::pSTE5-ura3::TRPI, ste4A :Hygro®, trp1::Nat*, strain construction 
details available upon request). Isogenic strains with either the CFP or YFP cassettes 
at the TRP1 locus (chromosome IV) were generated (PY5998 and PY5999) as 
follows: the pGAL1-CFP-tADH-SpHIS5 or pGAL1-YFP-tADH-SpHISS cassette 
was PCR amplified from plasmid pB2694 (CFP) or pB2697 (YFP), respectively, 
with primers del T[RPGFP5’ (5’-TATTGAGCACGTGAGTATACGTGATTAAG 
CACACAAAGGCAGCTTGGAGTGCAGGTCGACGGATCCCCGGG-3’) and 
delTRPGFP3’ (5’- GAACGTGCACTGAGTAGTATGTTGCAGTCTTTTGGAA 
ATACGAGTCGAATTCGAGCTCGTTTAAAC-3’) and transformed into PY5997 
at the TRP1 locus. The haploid ancestor strains expressing CFP (PY5998) or YFP 
(PY5999) were confirmed by PCR and fluorescence microscopy. The haploid ancestors 


were modified to become mating competent by transformation with plasmids PB2647 
(LEU2-STE4) and PB2649 (URA3-STE4-MA T«). Diploid zygotes were selected on 
-Ura -Leu plates, and then colony purified on YPD plates to allow plasmid loss. 
Diploid chromosome content was confirmed by flow cytometry and aCGH, and 
strains PY6006, PY6008, PY6014, and PY6022 were selected. The diploid ances- 
tors were made mating competent by transformation with plasmids PB2647 and 
PB2649. Tetraploid zygotes were pulled onto YPD plates using a micromanipu- 
lator, and after 2 days growth at 30 °C the ploidy of each zygote was determined by 
flow cytometry and aCGH. 

The SNF3-G439E mutation was constructed in the haploid YFP strain back- 
ground (PY5999) using the pCORE counter-selectable reporter system, a gift from 
M. Resnick. Primers SNF3_pCORE_KAN (5'-TGTTGGGGGTGTTATCATGA 
CTATAGCCAACTTTATTGTGGCCATTGTTGGGAGCTCGTTTTCGACAC 
TGG-3') and SNF3_pCORE_URA (5'-TATAAATGCTATCATAACTTTTGCG 
GCCGCTACAGTCTTTAAGGAACACTCCTTACCATTAAGTTGATC-3’) were 
designed to integrate the CORE sequence at the SNF3 locus; PCR amplification and 
transformation procedures were followed as detailed previously”. Sanger sequenc- 
ing was used to identify clones with the desired mutation (chromosome IV: 112,896 
G>A). Diploid SNF3-G439E mutants (heterozygous SNF3-G439E/SNF3 and homo- 
zygous SNF3-G439E/SNF3-G439E clones) were constructed by mating after intro- 
duction of plasmids to confer mating competence (PB2649 or PB2647), as described 
above for the construction of the CFP- and YFP-marked strains. An analogous 
strategy was used to generate tetraploid SNF3-G439E strains (heterozygous SNF3- 
G439E/SNF3/SNF3/SNF3). 

The chromosome XIII aneuploid strain series was constructed in the S288c back- 
ground from the diploid strain PY7295 (RL4737) and the diploid PY7296 (RL4888), 
which is trisomic for chromosome XIII°°. PY7296 was isolated from a triploid meiosis 
and a minimal number of cell divisions. We confirmed the chromosome XIII 
trisomy by aCGH (Extended Data Fig. 9). We generated tetraploid clones by mat- 
ing PY7295 to PY7296, with changes in mating-type accomplished as described 
previously’*. Tetraploid clones were isolated on selective media and analysed by 
flow cytometry and aCGH (representative clones Extended Data Fig. 9). Addi- 
tional details for all yeast strain constructions are available upon request. 
Relative fitness assays. Competitive fitness assays were performed using single 
colony isolates from the evolved populations and a common ancestor. One single 
colony was isolated from frozen stocks of each well of the evolution experiments 
(1N(A), IN(B), 2N(A), 2N(B), 4N(A), 4N(B), 4N(C)) at generation 250. The evolved 
clones were cultured for 24 h in 500 jl of Synthetic Complete + 2% raffinose, diluted 
into fresh medium, and competed with the ancestor expressing the complementary 
fluorescent protein. Competitions were initially performed using approximately 
the same number of cells from the ancestor and the evolved clone, but because the 
evolved clones grew significantly faster than the ancestor strains, the competitions 
were repeated using approximately five times more ancestor cells than evolved 
clone cells, with an initial population size of 1 X 10°. Serial dilutions were per- 
formed each day and the YFP:CFP ratio was determined by flow cytometry, yield- 
ing an estimate of the number of evolved (N,,) cells relative to the ancestors (Njo) 
as a function of time. The data were analysed in Matlab using a custom script that 
performed a linear least-squares fit of log(Nu/Nio) over multiple dilution cycles. 
The fitness relative to the ancestor is defined as s = d/dt [logo(Ny/Nio)], where t is 
measured in days’’. 

Flow cytometry analysis of DNA content. Cells were prepared for propidium iodide 
staining as described but with modifications to optimize preparation of samples 
in 96-well plates. Thirty thousand cells were analysed using a BD LSRII High 
Throughput Sampler. FlowJo cell cycle analysis used the Dean-Jett-Fox model to 
estimate the mean G1 and G2 fluorescence peaks of each strain. Control parental 
1N, 2N, 3N, and 4N strains were analysed in triplicate with the evolved strains. 

Microarray aCGH. Fluorescently labelled DNA was prepared for CGH as described 
previously*’. Genomic DNA from all experimental strains was compared with the 
same pool of genomic DNA from the ancestral strain background PY3295 (BY4741, 
Research Genetics). Agilent yeast DNA 444K microarrays (ChIP-on-chip Kit) 
were used for the hybridization according to the manufacturer’s instructions (Agilent 
Technologies) with several modifications (M. Dunham online protocols, http:// 
dunham.gs.washington.edu/protocols.shtml). Briefly, 2.0 jg of HaellI-digested (New 
England Biolabs) genomic DNA was labelled with 2.1 pil of Cy3 or Cy5 (CyDye- 
Cy3-dUTP or CyDye-Cy5-dUTP, Amersham GE Healthcare). Three hundred 
nanograms of Cy3-labelled DNA (experimental strains) was mixed with 300 ng of 
Cy5-labelled DNA (control DNA) and the volume was brought to 44 il with nuclease- 
free water. Blocking buffer and hybridization buffer 2 HiRPM (Agilent Technol- 
ogies) were added, and 100 ll was applied to each sub-array; the microarray was 
hybridized at 65 °C for 17h and then washed, scanned, and analysed according to 
the manufacturer’s instructions. Agilent Feature Extraction data were converted from 
logo ratios to log, ratios and plotted using Treeview” and a custom Matlab script. 
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A log, ratio of zero (baseline) indicates no difference in DNA copy number between 
reference and experimental samples'**’. 

Quantitative PCR. All qgPCRs were performed on an Applied Biosystems ViiA-7 
real-time PCR machine in 96-well format with Power SYBR Green PCR Master Mix 
(Applied Biosystems) and three technical replicates. HXT6/7 gene copy number was 
determined relative to the ancestor as previously described**. Genomic DNA was 
isolated and RNase treated from 30 clones with the highest fitness from each hap- 
loid, diploid, and tetraploid experiment. HXT6/7 on chromosome IVR was amp- 
lified using forward primer 5’-GATTATTGCTGGTCCGATCC-3’ and reverse 
primer 5’-GAGTAATCGCCAATGGGTCT -3’; the control loci UBP1 on chromo- 
some IVL was amplified using forward primer 5’-GCGCTCTGTCATTGTTCAC 
T-3’ and reverse primer 5'-GACTTTCAGCTTCGTCCACAA-3’. Raw HXT6/7 
values were normalized to UBP1 for each clone and then normalized to the ances- 
tor (AACt). We found the amplification in 3% of the 1N-, 30% of the 2N-, and 43% 
of the 4N-evolved clones (n = 30). This significant bias for HXT6/7 amplification 
in the 2N and 4N populations (t-test, P= 0.005 and P= 1 X 10°“, respectively) 
might have been due to mutations in 1N cells that prevented the acquisition of the 
HXT6/7 amplification because of negative epistasis”*. 

SNF3 gain-of-function mutations were previously shown to increase expression 

of HXT4 (ref. 25). Therefore we analysed HXT4 gene expression levels in diploid- 
evolved clone 2N_233 (carrying the SNF3-G439E mutation), relative to the diploid 
ancestor (PY6006). Strains were grown up from —80 °C stocks overnight in 5 ml 
Synthetic Complete + 2% raffinose and then diluted into fresh 25 ml Synthetic Com- 
plete + 2% raffinose. Cells were cultured at early log phase and RNA was extracted 
using an RNeasy Mini Kit (Qiagen). Complementary DNA was prepared using a 
SuperScript III First-Strand synthesis system (Life Technologies). HXT4 was amp- 
lified using primers 5'-TAAGGTCAGCGCAGACGATCCA-3’ and 5’-TTCACC 
CCAGGAGGCATTACCA-3’; ACT1 was amplified using primers 5’-ACGTCGC 
CTTGGACTTCGAACA-3’ and 5’-TGGAACAAAGCTTCTGGGGCTC-3’. Raw 
qPCR values were normalized to ACT] levels and then normalized to the 2N ances- 
tor (PY6006). Relative to the ancestor, the clone bearing SNF3-G439E had eightfold 
higher HXT4 expression in raffinose medium. 
Overview of WGS. We performed WGS and identified de novo variants for 74 
evolved clones and two ancestors. Initially six evolved clones and one tetraploid 
ancestor were sequenced on ABI’s SOLiD 4 platform. Subsequent sequencing of 68 
evolved clones and the haploid ancestor was performed on an Illumina HiSeq 2500. 
The specifics of the analysis pipeline for each sequencing platform are provided below. 
Regardless of the underlying platform, the overall analysis strategy was as follows. 
Briefly, the raw reads underwent quality analysis and barcode/adaptor removal. 
High-quality reads were mapped to the S. cerevisiae reference genome (downloaded 
June 2010). Reads containing PCR-based artefacts were removed and alignments 
underwent local realignment around insertions and deletions (indels), resulting in 
the highest quality alignment. SNPs and indels were called and combined across 
the evolved strains and within the parental strains to identify a set of variants in the 
strain background relative to the reference. Each evolved strain (all 74) was indivi- 
dually compared with the parental set to identify the set of potential de novo vari- 
ants. These evolved strain calls were filtered by quality metrics and manually inspected. 
All variants of moderate or poor quality as well as a few good quality variants were 
analysed by Sanger sequencing. Chromosomal aneuploidy was inferred from changes 
in read depth using a windowing approach. 

Per cell, the evolved tetraploids had more mutations than haploids or diploids. 
However, per haploid genome, the evolved tetraploids on average accumulated a 
similar number of mutations (1.50 average SNPs per haploid genome, on the basis of 
final evolved ploidy) as the evolved diploids (1.44 average SNPs per haploid genome) 
and fewer mutations than the evolved haploids (2.05 average SNPs per haploid 
genome); however, neither comparison was significant. It is likely that the number 
of mutations in the evolved tetraploids was underestimated because of the high rate 
of chromosome loss in these strains. Interestingly, there was a higher average for 
the tetraploid-evolved clones that became ~2N in ploidy (2.00 average SNPs per 
haploid genome) than the diploid-evolved clones (1.44 average SNPs per haploid 
genome), suggesting that near-diploid cells that underwent a tetraploid intermedi- 
ate might have acquired more mutations than cells that remained diploid through- 
out the experiment. However, despite the trend, this effect did not reach statistical 
significance (P = 0.22). 

Illumina sequencing (2 X 100). Library preparation. Clones selected for WGS were 
cultured overnight from —80 °C stocks in 2 ml Synthetic Complete + 2% raffinose 
medium. Genomic DNA was isolated using phenol-chloroform-isoamylalcohol 
(24:25:1) and bead beating. Libraries were prepared as described™. Briefly, DNA 
was sheered with Diagenode Bioruptor (UCD-200) toa median size of 300-500 bp, 
end-repair was performed with an NEB Next End repair kit (NEB E6050L), and frag- 
ments were A-tailed with Klenow fragment (M0212L). Custom adaptors with in-line 
barcodes were ligated overnight. Adaptor ligated fragments were size selected on 1% 
TBE agarose gel stained with Sybr Gold (Invitrogen S-11494) for fragments between 
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400 and 600 bp and isolated using Qiagen Gel Extraction Kit (28706). Libraries 
were amplified for 12 cycles with Illumina PE PCR primers 1.0 and 2.0. Libraries 
were pooled and underwent additional size selection for fragments of 400-600 bp. 

Raw data. The genomes were sequenced on an Illumina HiSeq 2500 at the Uni- 
versity of Colorado at Denver Next Generation Sequencing Facility. The data, which 
had an inline barcode, were de-multiplexed by the sequencing facility into individual 
sample R1/R2 files—one file for each read in the pair. The barcodes were removed 
before mapping using Fastx_trimmer (version 0.0.13.2, http://hannonlab.cshl.edu/ 
fastx_toolkit/). Read trimming from the 5’ end of the R2 reads was performed ona 
sample-specific manner trimming anywhere from 0 to 28 base pairs using in-house 
script and Fastx_trimmer. 

Mapping. Reads were mapped to the S. cerevisiae reference sequence for the lab- 
oratory yeast strain S288c reference genome (S. cerevisiae genome obtained 28 July 
2010 from the Saccharomyces Genome Database FTP site: http://downloads.yeast 
genome.org/sequence/S288C_reference/genome_releases/; actual genome: http:// 
downloads.yeastgenome.org/sequence/S288C_reference/genome_releases/S288C_ 
reference_genome_R63-1-1_20100105.tgz). The reads were mapped using the Bowtie2 
(version 2.0.2)” local alignment strategy, allowing for multiple mapping, and setting 
the following options:—very-sensitive-local -I 180 -X 1,000-score-min G,70,8. The 
mapped reads then underwent file format conversion into the binary format for 
downstream analysis using Samtools view, sort, and index (version 0.1.18)°°. 

Alignment tailoring. Post-alignment to the genome, duplicate pairs resulting 
from PCR overamplification were removed using Samtools rmdup, eliminating 
1-5% of the paired reads. The reads were re-aligned over potential indel sites using 
the Genome Analysis Toolkit RealignerTargetCreator and IndelRealigner (version 
2.4-9)5758, 

Variant calling and refinement. Variant calling was performed on the tailored read 
mappings using Genome Analysis Toolkit UnifiedGenotyper (version 2.4-9)°”"*. 
For the haploids and diploids, SNPs were called using default parameters; for the 
higher ploidy strains, the ploidy option was increased to 5n, which allowed iden- 
tification of mutations at allelic frequencies down to 5% alternative allele repres- 
entation. Variant lists were combined on the basis of ploidy-type using Genome 
Analysis Toolkit CombineVariants. SNPs and short indels were compared with the 
parental set of mutations using in-house scripts to generate a set of non-Parental 
mutations. These mutations were filtered for alternative allele support and allelic 
frequency (more than two reads supporting an alternative allele for coverage 10 
to 20X, and more than four reads supporting an alternative allele for coverage more 
than 20X). The filtered mutations were manually inspected using the Integrative 
Genome Viewer (version 2.1.19)” to refine the set and further remove mapping 
artefacts such as strand representation bias, regional mapping quality issues from 
non-unique mapping, and artefacts of homopolymer and simple repeat alignments. 
We Sanger sequenced variants with low read support (fewer than five reads support- 
ing an alternative allele), as well as a subset of the other medium- and high-confidence 
variants. The final set of evolved variants discovered was annotated versus the gene 
file specific to the genome using an in-house script. 

Chromosomal CNV identification. Identification of chromosomal CNVs used 
HTSeq (version 0.6.1)* in conjunction with custom scripts. HTSeq performs cov- 
erage estimations on a per-gene basis, and the custom scripts provided normalized 
log, fold change between each sample and the parental haploid strain. Estimates 
on chromosomal copy number were inferred using the median value for the log, fold 
change on a chromosome-by-chromosome basis. We implemented the Cochran- 
Armitage test to determine whether chromosome XIII had a trend for higher copy 
number, relative to the copy number observed for all chromosomes in the tetraploid- 
evolved clones (Fig. 2d and Supplementary Table 2). This trend analysis is similar to 
a ¢ test, but tests whether there is a significant trend or direction to the observed data 
set (chromosome XIII copy number). 

Sequencing quality assessment. Because our sequencing was highly multiplexed, 
quality assessment on the sequencing data was necessary to eliminate strains with- 
out adequate genome coverage. For the haploids and diploids, we determined the 
adequate depth of coverage to recover mutations in two ways. First, we took the set 
of ‘strain-background’ mutations, which were identified by filtering the parental 
variant calls for a conservative, high-quality (quality > 100), homozygous set of 
locations. Each strain was then queried for the ability to recapitulate these variants, 
reporting a percentage overlap between each strain’s variant calls and the set of 
background variants. Any strain with less than 97% of the background mutations 
was dropped from consideration. Additionally, we examined the impact of subsam- 
pling down to various depths to investigate the impact of lower coverage on reco- 
vering variants. This was done using Picard’s DownsampleSam jar (version 1.72, 
http://broadinstitute.github.io/picard/) on two higher-coverage diploid strains to 
randomly down-sample the coverage to 100X, 50, 25X, and 10X coverage. We 
examined the SNP call overlap and found that for the strain-unique SNPs they could 
be captured even at a level of 10X coverage. Using this information, we set minimum 
coverage requirements for each strain on a genome-wide scale to eliminate strains 
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without adequate genomic representation. We analysed depth of coverage on all 
of the mapped data with BEDTools genomeCoverageBed (version 2.16.2)°'. The 
per-base coverage was then analysed using an in-house script to produce statistics 
on minimum coverage per allele, average coverage, etc. 

SOLID sequencing (2 X50). Library preparation. We performed a pilot experiment 
on seven strains using SOLiD paired-end sequencing (Supplementary Table 1). Clones 
selected for SOLiD sequencing were cultured overnight from — 80 °C stocks in 4 ml 
Synthetic Complete + 2% raffinose medium. Genomic DNA was isolated using 
QIAGEN Genomic-Tip 100 according to the manufacturer’s instructions. SOLiD 
library preparation and sequencing was performed by the Molecular Biology Core 
Facility at the Dana-Farber Cancer Institute according to the manufacturer’s instruc- 
tions (Applied Biosystems, Life Technologies). 

Mapping. The sequencing reads were mapped to the S. cerevisiae reference genome 
(see ‘Mapping’ in the Illumina section above) using multiple different mapping 
software including BWA (version .0.5.9), NovoAlignCs (version 1.01.05)*, Bfast 
(version 0.6.5a)™, and BowtieCS (version 0.12.7). BowtieCS and BWA were used 
in the downstream variant calling and copy number changes, while NovoAlignCS 
and Bfast served as added support in manual inspection of variants. 

Alignment tailoring. After mapping, the reads were post-processed for local realign- 
ment using SRMA (version 0.1.15)** and Samtools BAQ (version 0.1.18)°°. 

Variant calling and refinement. SNPs, small insertions and deletions (indels) were 
called from the post-processed reads using Samtools Mpileup (version 0.1.18)°°, 
VARiD (version 1.0.7f)*”, and Freebayes (version 0.8.9, http://bioinformatics.bc. 
edu/marthlab/FreeBayes). Samtools and VARiD variant calls were used to identify 
the strain background (parental variants relative to the reference). These variants 
were filtered on the basis of reads supporting the allele in both directions, quality 
score of the call, and adequate read coverage over the call. Once filtered, all of the 
variant calls for the evolved strains were merged and compared with the parent. 
Variations were verified by manual inspection followed by Sanger validation for 
both a set of randomly sampled loci and regions of disagreement between different 
combinations of the mapping software and the variant callers (that is, dinucleotide 
SNPs and multiple indels within a single read). The resulting set was later used for 
identification of strain-unique variants in the evolved strains. 

To identify strain-unique variants, Freebayes, a variant caller capable of higher- 
ploidy (ploidy > 2N), was used. Freebayes has the ability to set the assumed ploidy 
over a genomic region to adjust the expected distribution for allelic frequency. The 
assumed ploidy was determined using aCGH as well as the copy number changes. 
The Freebayes called variants on each evolved progeny were then cross-referenced 
with the parental variants to produce strain-unique variants (Supplementary Table 1). 
These variants were then manually examined in Integrative Genomics Viewer” 
and validated by Sanger sequencing. PCR amplification and Sanger sequencing of 
approximately 200 bp on either side of the sequence variants used DNA from the 
evolved clone and the ancestor. 

Chromosomal CNVs. Copy number changes, first identified in the aCGH data, were 
confirmed in the genomic DNA sequencing using BedTools genomeCoverageBed*' 
in combination with custom in-house scripts and DESeq (version 1.10.1)". Briefly, 
the normalized genomic copy number of all annotated genes in each strain was com- 
pared back with the parent. These comparisons then were plotted using an in-house 
script (see Chromosomal CNV identification above). 
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Generation of isogenic YFP ploidy series 
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Extended Data Figure 1 | Schematic representation of the construction of isogenic haploid, diploid, and tetraploid strains used in this study. Relevant strain 
numbers are indicated for the CFP- and YFP-containing ancestors. 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a AN 
Model and mn s 
Distribution 
New 2.59107 5.0-10° 0.03 


Exponential |[3.98-10~*, 3.98-10°] [0.012, 0.018] [1-10°, 3.16*10°] (0.022, 0.026] | [1.99-10°, 7.910%] [0.028, 0.035] 


New 2.0°10° 0.06 9°10" 0.0925 1.2107 0.15 
Uniform [1-10°, 3.2°10°] (0.115, 0.13] | [6.3-10°%, 1-10°] [0.18, 0.195] [7.94-10°, 2.510%] [0.275, 0.325] 


New 3.9+10° 0.11 2.5¢ 10%. 0.16 1.99+10° 0.31 


Delta [2.5°10°%, 6.310%] (0.105, 0.115] | [1.9°10°, 3.1-10%]  [0.155, 0.165] | [1.58-10°, 3.1-10°] [0.295, 0.325] 
Equivalence 7 7 7 
Principle 2.5°10° 0.1 2.5°10° 0.13 7.9°10 0.2 


model, Detta |[1-58*107, 3.16+107] [0.95, 0.105] | [1.99+107, 2.81-107] [0.125, 0.135] | [6.3+107, 1-10] —_[0.195, 0.21] 


o 
) 


Uniform distribution ¢ : Delta distribution 


(D) = 

Ww 

Oo 

Zs -4 

Cc 

ie) 

5 5 

© 

3 6 

2 a 

io) -7 

ce 

e)) 

O -8 -8 

— 0:0 0.05 0.1 0.15 0 0.05 0.1 0.15 0.2 0.25 0.3 
d @ Exponential distribution @ , Equivalence Principle model 

® 

— 

c -5 

ie) 

= 

£ 6 

=) 

= S 

=) 

cc 

2 -8 -8 

D .g| ! 

= O 0.02 0.04 0.06 0.08 0.1 0 0.05 0.1 0.15 0.2 0.25 0.3 


fitness effect fitness effect 


Exp 

Uniform 
Almost Delta 
Delta 


Probability density 
distribution function 


fitness effect 


©2015 Macmillan Publishers Limited. All rights reserved 


Extended Data Figure 2 | Estimates from our mathematical modelling of 
the best-fit value of the beneficial mutation rate (2) and the selection 
coefficient (s) of each ploidy evolution experiment. a, Table of and s values 
that had the best-fit between the simulations and the experimental data; 
brackets, 95% confidence intervals. Values were determined on the basis of 
different assumptions about the underlying distribution of beneficial 
mutations, which included (b) uniform, (c) delta, and (d) exponential 
distributions. Estimates of js and s were also obtained with (e) the equivalence 
principle model’’, which assumes a delta distribution of beneficial mutations. 
Each two-dimensional plot includes the error range obtained by parametric 
bootstrap of 1,000 independent simulated data sets (Methods). The 95% 
confidence intervals of js and s from those 1,000 data sets were defined as the 
error ranges. f, Schematic diagram of the three distributions of fitness effects 
that we used in our mathematical modelling: exponential (red), uniform 
(black), and delta (green) distributions. For illustration, we also provide a 
narrow Gaussian distribution (blue) that is close to a delta function. The real 
distribution of fitness effects probably has a more complex structure than any of 
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the examples shown. The diagram illustrates the fact that the shape of the 
assumed distribution mandates differences in mutation rates. For example, if 
the mutations that mainly drive adaptation fall within the region of the double 
arrow, only a small proportion of the mutations from the exponential 
distribution will fall within this range, necessitating a much higher mutation 
rate to generate mutations in this region. By contrast, the delta distribution lies 
in the middle of the double arrow range; therefore, all of the mutations that 
arise from this distribution are strong enough to contribute to adaptation, 
resulting in a relatively lower mutation rate. The uniform distribution is 
intermediate between these two extremes. Only a small portion of the 
mutations of the uniform distribution is within the double arrow region, but the 
probability of these mutations is orders of magnitude larger than the 
exponential. Therefore, the mutation rate of the uniform is closer to the delta 
than to the exponential distribution. The values used to generate this figure 
are the best-fit values of j and s of the haploid populations in the different three 
distributions. See Methods for more details. 
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Extended Data Figure 3 | Experimental and computational analyses of the 
noise in our experimental measurements and of the methods used in our 
mathematical modelling (see Supplementary Information). a, Three 
different methods were used to determine the percentage of YFP-expressing 
cells in mixtures of the 1N, 2N, and 4N CFP and YFP ancestor strains. Cells 
were analysed by flow cytometry (10,000 cells) and fluorescence microscopy 
(300 cells), and by single colony analysis (96 colonies) of the mixture before 
galactose induction. The percentage YFP determined by all three methods was 
highly correlated (Pearson correlation coefficient = 0.98). b, Table showing 
variation in flow cytometry replicate measurements: the standard deviation of 
the percentage YFP obtained from 48 replicate populations of six different 
CFP:YFP ratios, for each ploidy type. c, Table showing the average and standard 
deviation of the best-fit values for different ratios between beneficial (U,) and 
deleterious (Ug) mutations, obtained from 100 independent data sets. 

d, Evaluation of different summary statistics by calculating the distribution of 
best-fit values from 1,000 replicate simulations. Four different summary 
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statistics were used to analyse 1,000 replicates of a parameter pair, s and yu 
(see Methods). The summary statistic using ten bins has the highest mode 
and no outliers, and was used to generate our best-fit values. e, Criteria for 
exclusion of near-neutral mutations for implementation of the branching 
evolutionary model with an exponential distribution of mutations. Shown is the 
average deviation from equal percentages of YFP and CFP-expressing cells 
with different thresholds for neutral mutations. The threshold (Tr) represents 
the fraction of the average fitness effect (s), meaning every mutation whose 
absolute value was smaller than Tr X s was excluded. For this scenario (with 
parameters pu = 2 X 10 ° and s = 0.08), we can exclude every mutation with a 
fitness effect smaller than s (that is, Tr = 1, light blue) without changing the 
outcome relative to excluding no mutations (Tr = 0). However, when 
excluding all mutations with fitness effects smaller than 10 s (Tr = 10, dark 
green), the result changes substantially. Thus, for high mutation rates (Nu > 1), 
we can exclude weak mutations”. 
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Extended Data Figure 4 | aCGH karyotype of the ancestor strains used in 
this study. Aneuploidy was not detected in the parental 1N, 2N, or 4N strains. 
Genomic DNA from each strain was compared with that of an isogenic 
ancestor PY3295 (BY4741 MATa ura3 his3 trp1 leu2 LYS2) andlog, DNA copy 
number ratios were plotted using a custom Matlab script. To account for 


regions of complete deletion, the data were cropped at log, ratios of +2.0 
and averaged across each chromosome using a sliding window of nine 
oligonucleotides. A log, ratio of zero is indicated by the red line. Loci altered 
during strain construction are indicated (TRP1, pSTE5, URA3, STE4). Strain 
ploidy, determined by flow cytometry, is indicated on the right. 
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Extended Data Figure 5 | aCGH karyotype of haploid- and diploid-evolved 
clones at generation 250. a, aCGH of eight haploid-evolved clones. Data are 
displayed as in Extended Data Fig. 4. No aneuploidy was detected. Clone 
1N_131 acquired the HXT6/7 amplification (arrow). b, aCGH of eight 


diploid-evolved clones. No aneuploidy was detected, but all clones except 
2N_233 acquired the HXT6/7 amplification. Log, ratios were averaged across 
each chromosome using a sliding window of 29 oligonucleotides. The ploidy of 
the evolved clone, determined by flow cytometry, is indicated on the right. 
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Extended Data Figure 6 | aCGH karyotype for 20 tetraploid-evolved clones 
at generation 250. aCGH data are displayed as in Extended Data Fig. 4. Note 
that whole chromosome or large segmental chromosome gain and loss events 
are observed in all clones except clone 4N_337. Ploidy of the evolved clone, 
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determined by flow cytometry, is indicated on the right, with +/— indicating 
chromosome aneuploidy. Some highly aneuploid clones had widely different 
chromosome copy numbers for different chromosomes (for example, some 
chromosomes were disomic, others trisomic and tetrasomic). 
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Extended Data Figure 7 | Analysis of recurrent and concerted chromosome 
loss events in the tetraploid-evolved clones. a, Evolved tetraploids acquired 
large segmental aneuploidies (regions greater than the ~7 kb HXT6/7 
amplification); aCGH data for individual chromosomes with large segmental 
aneuploidies in 4N-evolved clones (plotted using Treeview”’). All breakpoints 
occurred at or near Ty sequences (arrowheads). b, The pairwise patterns 
(Pearson correlation) of all chromosome copy number alterations in the 
4N-evolved clones at generation 250 (n = 30, Supplementary Table 2). The 
copy numbers of some chromosomes were correlated (for example, 
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chromosome XV and chromosome XVI), whereas others were anti-correlated 
(for example, chromosome VIII and chromosome IX), possibly reflecting the 
need for gene expression balance. c, Hierarchical clustering showing the 
copy number relationship among the chromosomes. d, Proportion of all 
chromosomes in the evolved tetraploid clones with the indicated copy number 
(black). The copy number of chromosome XIII (grey) in the 4N-evolved clones 
at generation 250 was significantly different from that of all other aneuploid 
chromosomes (Cochran-Armitage test, P< 1X 10”). 
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Extended Data Figure 8 | aCGH karyotype for tetraploid-evolved clones 4N_gen35_503, which remained tetraploid. Data are displayed as in Extended 
at generations 35, 55, and 500. All 4N-evolved clones at (a) generations 35 _—_— Data Fig. 4. Ploidy of the evolved clone, determined by flow cytometry, is 
and 55 and (b) generation 500 were aneuploid for multiple chromosomes or _ indicated on the right, with +/— indicating chromosome aneuploidy. 

carried large segmental chromosome aneuploidies, except for clone 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


Aneuploid Strain Construction 


Goma: [OO ee Ploidy 
PY7296 sai peiiteNaenacal adel 4 2N + chr. Xil 


1 


4N + chr. XIl 


PY7303 PAP Ag A nm a 


PY7300 PP mt Ah tin Tr | 4N + chr. Xill 


PY7301 Pr am ey ON my 4 4N + chr. Xill 


Extended Data Figure 9 | aCGH from isogenic 2N and 4N strains with an extra copy of chromosome XIII or chromosome XII. Data are displayed as in 
Extended Data Fig. 5b. 
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Extended Data Table 1 | Yeast strains and plasmids used in this study 


Strain (Ploidy) or Parental Relevant genotype Source 

Plasmid strain 

BY3295 (1N) BY4741 MATa his3A leu2A met15A ura3A Pellman collections 
PY5997 (1N) BY3295 matA::pSTE5-ura3::TRP1 ste4A::Hygro” trp1 ::Nat® This study 

PY5998 (1N) PY5997 = matA::p STES-ura3:: TRP1 ste4A::Hygro™ tro1::Nat’::pGAL-ce CFP-tADH-SpHIS5 This study 

PY5999 (1N) PY5997 — matA::p STE5-ura3::TRP1 ste4A::Hyaro" tro 1::Nat®::pGAL-e YFP-tADH-SpHIS5 This study 

PY6006 (2N) PY5999 = (2x) matA::pSTE5-ura3::TRP1 ste4A::Hygro® trp1::Nat*::p GAL-e YFP-tADH-SpHIS5 This study 

PY6008 (2N) PY5998 (2x) matA::pSTE5-ura3:: TRP1 ste4A::Hygro® trp1::Nat<::pGAL-ceCFP-tADH-SpHIS5 This study 

PY6014 (2N) PY5999 = (2x) matA::pSTES-ura3::TRP1 ste4A::Hygro™ trp 1::Nat’::pGAL-e YFP-tADH-SpHIS5 This study 

PY6022 (2N) PY5998 (2x) matA::pSTE5-ura3::TRP1 ste4A::Hygro® trp1 ::Nat*::p GAL-ceCFP-tADH-SpHIS5 This study 


PY6008 (4x) matA::pSTE5-ura3::TRP1 ste4A::Hygro® trp1::Nat®:: 


PY6031 (4N) pGAL-ceCFP-tADH-SpHIS5 This study 


PY6032 (4N) PY6022 (4x) matA::pSTE5-ura3::TRP1 ste4A::Hygro™ trp1::Nat<::pGAL-ceCFP-tADH-SpHIS5 This study 
PY6040 (4N) PY6006 (4x) matA::p STE5-ura3::TRP1 ste4A::Hygro® trp 1::Nat®::p GAL-e YFP-tADH-SpHIS5 This study 
PY6045 (4N) PY6014 (4x) matA::pSTE5-ura3::TRP1 ste4A::Hygro® trp1 ::Nat*::p GAL-e YFP-tADH-SpHIS5 This study 
PY7284 (1N) PY5999 SNF3-G439E This study 
PY7289-PY7290 (2N) PY5999 SNF3-G439E/SNF3 This study 
PY7285-PY7288 (2N) PY5999 SNF3-G439E/SNF3-G439E This study 
PY7291-PY7294 (4N) PY5999 SNF3-G439E/SNF3/SNF3/SNF3 This study 
PY7295 (2N) $288c RLY4737 MATa/a ura3A his3A trp1A leu2A Ref. 50 

PY7296 (2N) PY7295 RLY4888 MATa/a + chr. XIII trisomy Ref. 50 

PY7297-PY7299 (4N) PY7295 MATa/a/a/a This study 
PY7300-PY7302 (4N) PY7295  MATa/a/a/a + chr. XIll pentasomy This study 
PY7303-PY7305 (4N) PY7295 MATa/a/a/a + chr. XII pentasomy This study 


PB1500 YFP-tADH-SpHIS5, Amp* Yeast Resource Center 
PB1499 CFP-tADH Kan® Amp* Yeast Resource Center 
PB2452 CFP-tADH SpHIS5, Amp* Pellman collection 
PB2694 pGAL1-ceCFP-tADH-SpHIS5, Amp” This study 

PB2697 pGAL1-eYFP-tADH-SpHIS5, Amp* This study 

PB2314 MATa::pSTE5-URA3, Amp” Ref. 12 

PB1308 ura3::TRP1 Amp" Ref. 46 

PB2577 MATa URA3 CEN Amp* This study 

B1819 LEU2 CEN Amp* Pellman collection 
PB2647 STE4 LEU2 CEN Amp" This study 

PB2649 STE4 URA3 CEN MATa, Amp*® This study 

PB1640 hphMxX4 Amp* Ref. 47 

PB1942 PGAL-HO HIS3 Amp* Gift of the Fink lab 
PB1650 pGAL-HO URA3 LEU2 Gift of the Elion lab 
pCORE kanMX4 KIURA3 Ref. 48 
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Thermosensory processing in the Drosophila brain 


Wendy W. Liu’, Ofer Mazor!? & Rachel I. Wilson! 


In Drosophila, just as in vertebrates, changes in external temper- 
ature are encoded by bidirectional opponent thermoreceptor cells: 
some cells are excited by warming and inhibited by cooling, whereas 
others are excited by cooling and inhibited by warming’. The central 
circuits that process these signals are not understood. In Drosophila, 
a specific brain region receives input from thermoreceptor cells’. 
Here we show that distinct genetically identified projection neurons 
(PNs) in this brain region are excited by cooling, warming, or both. 
The PNs excited by cooling receive mainly feed-forward excitation 
from cool thermoreceptors. In contrast, the PNs excited by warming 
(‘warm-PNs’) receive both excitation from warm thermoreceptors 
and crossover inhibition from cool thermoreceptors through inhib- 
itory interneurons. Notably, this crossover inhibition elicits warming- 
evoked excitation, because warming suppresses tonic activity in cool 
thermoreceptors. This in turn disinhibits warm-PNs and sums with 
feed-forward excitation evoked by warming. Crossover inhibition 
could cancel non-thermal activity (noise) that is positively corre- 
lated among warm and cool thermoreceptor cells, while reinforcing 
thermal activity which is anti-correlated. Our results show how cen- 
tral circuits can combine signals from bidirectional opponent neu- 
rons to construct sensitive and robust neural codes. 

The simplest model of thermosensory processing is that warm thermo- 
receptor cells contribute only to the representation of warming, whereas 
cool cells contribute only to the representation of cooling (the labelled- 
line hypothesis**). In support of this idea, ablating heat-activated thermo- 
receptor cells in the mouse abolishes behavioural responses to noxious 
warming but not cooling®. Similarly, in Drosophila, silencing warm- 
activated thermoreceptor cells attenuates behavioural responses to warm- 
ing but not cooling; conversely, silencing cool-activated thermoreceptor 
cells attenuates behavioural responses to cooling but not warming’. An 
alternative model is that different thermoreceptor cells interact centrally, 
so that a single cell contributes to the representation of both cooling and 
warming*. This idea could explain perceptual phenomena like the ther- 
mal grill illusion, where interlaced warm and cool objects produce the 
perception of burning heat’. 

We investigated these possibilities in the Drosophila thermosensory 
system. Peripheral thermoreceptor cells in Drosophila reside mainly in 
the antenna®”?°, and are housed inside two structures termed the arista 
and the sacculus*’. The arista contains three cells that are excited by 
warming and inhibited by cooling (‘warm cells’), as well as three cells 
that are excited by cooling and inhibited by warming (‘cool cells’). The 
sacculus contains additional cool cells. All these cells project to a region 
of the brain called the proximal antennal protocerebrum”’. Recent studies 
have identified several genes involved in thermal transduction in the 
antenna”*’®’?, Among these, the warm receptor Gr28b.d has a particu- 
larly specific expression pattern*”’, permitting genetic access to the warm 
cells in the arista. 

In a visual screen of ~7,000 Gal4 lines™, we identified several that 
label central neurons with dendrites in the proximal antennal proto- 
cerebrum and axons projecting to higher brain regions. We call such cells 
thermosensory projection neurons (PNs). We used these Gal4 lines to 
drive GFP expression in small groups of PNs and performed in vivo 
whole-cell current-clamp recordings from these neurons. We included 
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Figure 1 | Projection neurons excited by cooling. a—d, Fast-cool-PNs. 

a, In vivo recording from a fast-cool-PN. Stimuli are steps from ~24 °C to 
~13°C or ~34°C. V,,, membrane potential. b, Mean firing rate for the 
fast-cool-PNs +s.e.m. across experiments (n = 7-10 PNs per stimulus per 
condition), with either arista intact or arista removed. c, Sensitivity to small 
temperature steps: mean change in firing rate versus change in temperature, 
mean + s.e.m. (n = 5). d, Proposed circuit for fast-cool-PNs (see 
Supplementary Discussion regarding circuit diagrams). e—h, Slow-cool-PNs. 
e, Slow-cool-PNs show less adaptation, and they respond with similar peak 
firing rates to fast and slow cooling. Prolonged depolarization can elicit 
complex spikes (inset, expanded 12-fold). f, For the slow-cool-PNs, removing 
the arista has comparatively little effect (n = 7-14); g, Slow-cool-PNs are less 
sensitive to small cooling steps (n = 7). In h the thinner arrow represents a 
weaker connection. 
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biocytin in the patch pipette to reconstruct the morphology of indivi- 
dual neurons. 

Because peripheral thermoreceptor cells reside in the antenna, we 
generated a thermal stimulus by manipulating the temperature ofa stream 
of solution flowing over the antenna while keeping the temperature of 
the brain constant (Extended Data Fig. 1). As a control, we confirmed that 
cutting the axons of peripheral thermoreceptor cells abolished responses 
in this brain region (see Methods). We used large temperature changes 
to robustly activate peripheral thermoreceptor cells (~10 °C). These 
large temperature changes were delivered at two speeds: fast (~ 100 ms) 
and slow (~7s). We also used small steps (<1 °C over ~100 ms) to 
measure sensitivity near threshold. This panel of stimuli revealed four 
physiological types of PNs, each with a distinctive morphology (Extended 
Data Fig. 2). 

Two PN types were excited by cooling and inhibited by warming. 
One type showed strong adaptation to sustained temperature decreases 
(Fig. 1a, b and Extended Data Fig. 3). These PNs also responded better 
to fast cooling than to slow cooling. We term these ‘fast-cool-PNs’. 
Fast-cool-PNs were highly sensitive to small cooling steps (Fig. 1c). 
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Fast-cool-PNs project to the lateral protocerebrum, a higher brain region 
that receives multimodal inputs’’. 

By contrast, the second type of “cool-PN’ showed little adaptation to 
large sustained temperature decreases (Fig. le, f and Extended Data 
Fig. 3). Moreover, slow and fast cooling elicited similar peak firing rates 
(Fig. le, f). We term these ‘slow-cool-PNs’. Slow-cool-PNs project to 
the mushroom body, a higher brain region involved in learning and 
memory’®. Slow-cool-PNs were less sensitive than fast-cool-PNs to small 
cooling steps (Fig. 1c, g). Thus, these two cool-PN types encode differ- 
ent features of cool stimuli (Supplementary Discussion). 

These cool-PN types have dendrites that overlap with the axons of 
the cool thermoreceptor cells (Extended Data Fig. 4). Clipping the arista 
largely eliminated the responses of fast-cool-PNs (Fig. 1b). By contrast, 
the responses of slow-cool-PNs remained mostly intact, although the 
initial response to cooling was diminished, and tonic firing at room 
temperature was reduced (Fig. 1f). All responses in slow-cool-PNs were 
abolished by cutting the antennal nerve (data not shown). Together, these 
results suggest a simple wiring scheme: fast-cool-PNs receive input from 
the cool cells in the arista, whereas slow-cool-PNs receive input mainly 
from the cool cells in the sacculus, with a weaker input from cool cells in 
the arista. 

The idea ofa simple wiring scheme was reinforced by additional exper- 
iments on the cool-PNs. Namely, we observed only slight disinhibition 
in the cooling responses of cool-PNs when we bath-applied antagonists 
of inhibitory neurotransmitter receptors (Extended Data Fig. 5). Warm- 
ing still suppressed the tonic activity of the cool-PNs when synaptic 
inhibition was blocked, implying that warming can inhibit these neu- 
rons by suppressing tonic excitation from cool thermoreceptor cells. 
Cool-PN responses were essentially unaltered by a functional null muta- 
tion in Gr28b.d, the warm receptor in the arista’ (Extended Data Fig. 5). 
Together, these results suggest that the thermal responses of the cool- 
PNs are driven primarily by cool thermoreceptor cells, with little role 
for warm thermoreceptor cells (Fig. 1d, h). 

Unlike the representation of cooling, we found that the representa- 
tion of warming is mediated by a relatively complex circuit. We iden- 
tified one PN type that is excited by warming and inhibited by cooling 
(Fig. 2a, b). These warm-PNs project to the lateral protocerebrum (Ex- 
tended Data Fig. 2). Warm-PNs responded better to fast warming than 
to slow warming, and they showed adaptation to prolonged temperature 
increases (Fig. 2b and Extended Data Fig. 3). Their responses were 
abolished by removing the arista (data not shown). 

Blocking synaptic inhibition depolarized warm-PNs (by 6 + 1 mV, 
mean + s.e.m.), suggesting these cells are subject to tonic inhibition. 


Figure 2 | Projection neurons excited specifically by warming. a, A warm- 
PN. Cooling evokes a barrage of excitatory postsynaptic potentials (inset, 
expanded fourfold). V,,, membrane potential. b, Mean firing rate for warm- 
PNs (n = 6-8). c, Mean firing rate with inhibition blocked using picrotoxin (an 
antagonist of GABA, and GluCl receptors””**) and CGP54626 (a GABA, 
antagonist”) (1 = 3 or 4). Spiking at cooling onset suggests excitation from the 
cool pathway. Spiking at cooling offset likely arises from the warm pathway and 
is normally masked by slowly-decaying cooling-evoked inhibition. d, Mean 
firing rate in the Gr28b.d mutant (n = 5-7). The response to slow warming was 
smaller than that in wild type, although the difference was not statistically 
significant (P = 0.20). Blocking inhibition eliminates the excitatory response to 
fast warming steps (n = 4). e, Mean firing rate in genetic background for 
histamine experiments, before injecting histamine (n = 7). A histamine-gated 
chloride channel is expressed in warm thermoreceptor cells. f, Injecting 
histamine to silence warm thermoreceptor cells significantly reduces responses 
to slow warming (n = 7, P< 0.05, paired t-test). Blocking inhibition eliminates 
warming-evoked excitation and unmasks cooling-evoked excitation (n = 6). 
See Extended Data Fig. 7 for controls. g, Sensitivity to small temperature steps 
(wild type n = 9; wild type with inhibition blocked n = 6; Gr28b.d mutant 
n= 5, Gr28b.d mutant with inhibition blocked n = 3). In wild type, blocking 
inhibition had no significant effect on sensitivity to warming (unpaired t-tests, 
with iterative Bonferroni corrections; see Methods). However, responses were 
significantly smaller in the mutant near the threshold of warming-evoked 
excitation (+0.425 °C, P = 0.017). h, Proposed circuit for the warm-PNs. 
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Notably, blocking synaptic inhibition also unmasked weak cooling-evoked. 
excitation in warm-PNs (Fig. 2c). This implies that these PNs receive 
input from both warm and cool thermoreceptor cells in the arista, and 
weak cooling-evoked excitation is normally masked by strong cooling- 
evoked inhibition. Moreover, although cooling hyperpolarizes these 
cells, it also elicits a barrage of excitatory postsynaptic potentials riding 
on the hyperpolarized membrane potential (Fig. 2a, inset). This obser- 
vation is further evidence that these cells receive excitatory input from 
cool thermoreceptor cells. Finally, we note that the dendrites of warm- 
PNs overlap with the axons of both warm and cool thermoreceptor 
cells (Extended Data Fig. 4), compatible with the idea that they receive 
input from both. 

Notably, mutating the warm receptor gene Gr28b.d did not abolish 
excitatory responses to warming in these PNs (Fig. 2d). Indeed, responses 
to fast warming were almost unchanged. Interestingly, however, responses 
to slow warming were smaller than in wild-type PNs. These results sug- 
gest that there are two pathways for generating warming-evoked excita- 
tion in these PNs. Gr28b.d is necessary for generating normal responses 
to slow warming, but fast warming recruits a second pathway inde- 
pendent of Gr28b.d (Supplementary Discussion). 

Based on these results, we hypothesized that the cool pathway can 
excite these neurons in response to warming through disinhibition. This 
hypothesis is motivated by the observation that cool thermoreceptor 
cells are tonically active and their tonic activity is suppressed by warm- 
ing’. An interposed inhibitory neuron could relay this tonic activity in 
the form of tonic inhibition, and warming would withdraw this inhibi- 
tion. This hypothesis predicts that blocking synaptic inhibition will elim- 
inate the second pathway. Indeed, we found that blocking inhibition 
abolished warming-evoked excitation in warm-PNs in the Gr28b.d 
mutant (Fig. 2d). This result was not observed in a wild-type background; 
in that case, blocking inhibition potentiated warming responses rather 
than eliminating them (Fig. 2c). As an additional control, we confirmed 
that blocking inhibition had essentially no effect on the cool-PNs in the 
Gr28b.d mutant (Extended Data Fig. 5). 

In principle, a mutation in the warm receptor Gr28b.d might cause 
the circuit to re-wire, or there might be other receptor molecules that 
could mediate responses in warm thermoreceptor cells when Gr28b.d is 
absent. To address these caveats, we acutely silenced the warm thermo- 
receptor cells themselves. We used the LexA system to selectively express 
a histamine-gated chloride channel (Ort) in warm thermoreceptor cells 
under the control of the Gr28b.d promoter (Extended Data Fig. 6). Neu- 
rons that mis-express Ort in this manner can be acutely silenced by 
histamine’’. We first confirmed that simply expressing Ort in warm 
thermoreceptor cells did not substantially alter the responses of warm- 
PNs (Fig. 2e). During these recordings, we then microiontophoresed 
histamine into the antenna to silence the warm thermoreceptor cells. 
PN responses to fast warming were largely intact, although responses 
to slow warming were significantly reduced (Fig. 2f). Blocking inhi- 
bition nearly abolished warming-evoked excitation when the warm 
thermoreceptor cells were silenced (Fig. 2f). Thus, this manipulation 
phenocopies the Gr28b.d mutant. These results further support a model 
whereby the cool pathway can cause warming-induced excitation via 
disinhibition. 

Small temperature steps (<1 °C) yielded further insight into the two 
pathways which provide input to warm-PNs. Under normal conditions, 
warm-PNs were excited by small warming steps (Fig. 2g). Blocking inhi- 
bition unmasked a response to small cooling steps (Fig. 2g). Responses 
to small warming steps were still present in the Gr28b.d mutant, but— 
unlike responses to large warming steps—were significantly reduced. 
Responses to small warming steps were eliminated in the mutant after 
inhibition was blocked (Fig. 2g). These results bolster the conclusion 
that both the warm pathway and the cool pathway can drive excitatory 
responses in warm-PNs (Fig. 2h). The warm pathway is most important 
when the rate of warming is low (~1 °C per s, as in our ‘small’ stimuli 
and ‘slow large’ stimuli). The cool pathway becomes dominant when 
the rate of warming is high (~10 °C over 100 ms). 
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The fourth and final projection neuron type we identified was excited 
by both cooling and warming (Fig. 3a, b). These ‘warm-cool-PNs’ pro- 
ject to the lateral protocerebrum (Extended Data Fig. 2). They responded 
to fast but not slow changes in temperature, and their responses to 
sustained stimuli were strongly adapting (Fig. 3b and Extended Data 
Fig. 3). Thus, these neurons signal rapid temperature shifts, regardless of 
the direction of those shifts. Removing the arista eliminated all responses 
in these neurons (data not shown). Blocking inhibition depolarized 
these cells (by 8 + 4mV, mean = s.e.m.) with modest effects on their 
thermal responses (Fig. 3c). 
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Figure 3 | Projection neurons excited by both warming and cooling. 

a, A warm-cool-PN. Fast warming and cooling elicit transient bursts of spikes 
(insets, expanded sevenfold). Stimulus offset does not elicit a response, 
probably because temperature is changing more slowly. V,,, membrane 
potential. b, Mean firing rate (n = 10-13). c, Mean firing rate with inhibition 
blocked (n = 3-5). d, Mean firing rate in the Gr28b.d mutant, before and 
after blocking inhibition (n = 7-11 and 4). e, Mean firing rate in genetic 
background for histamine experiments, before injecting histamine (n = 4). 

A histamine-gated chloride channel is expressed in warm thermoreceptor cells. 
f, g, Responses after injecting histamine to silence warm thermoreceptor cells 
(n = 4) and after blocking inhibition (n = 3). See Extended Data Fig. 7 for 
controls. h, Proposed circuit for the warm-cool-PNs. 
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Figure 4 | Inhibitory local neurons responding 
to thermal stimuli. a, Recording from a 
GABAergic LN that is excited by cooling and 
inhibited by warming (a ‘cool-LN’). V,,, membrane 
potential. b, Trial-averaged firing rates. 
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be related to both membrane potential and firing 
rate. d-f, Same as above, but for a different 
GABAergic LN. g, Sensitivity to small temperature 
steps in thermoreceptor cells of the arista, 
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experiment. See also Extended Data Fig. 6. 
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Notably, warming-evoked excitation persisted in warm-cool-PNs 
in the Gr28b.d mutant (Fig. 3d and Extended Data Fig. 8), just as we 
observed in warm-PNs. As before, blocking synaptic inhibition abolished 
warming-evoked excitation in the Gr28b.d mutant (Fig. 3d). Silencing 
thermoreceptor warm cells with histamine again phenocopied the Gr28b.d 
mutation (Fig. 3e-g). These results imply that warming-evoked excita- 
tion in these cells arises from two sources: increased excitation from the 
warm pathway, and withdrawal of tonic inhibition from the cool path- 
way (Fig. 3h). The dendrites of warm-cool-PNs overlap with the axons 
of both warm and cool receptor cells (Extended Data Fig. 4), compatible 
with the conclusion that they receive input from both. 

These findings predict that there are inhibitory neurons in this brain 
region that are tonically active, inhibited by warming, and excited by 
cooling. To test these predictions, we made recordings from inhibitory 
local neurons in this brain region. The main inhibitory neurotransmitters 
in the Drosophila brain are GABA and glutamate'* ”, and so we labelled 
either GABAergic or glutamatergic neurons with GFP to target our 
electrodes to these cells (Extended Data Fig. 9). In total, we recorded 
from 17 inhibitory neurons that arbourized locally in this brain region 
and responded to thermal stimuli (Extended Data Fig. 2). 

Most of the inhibitory local neurons (LNs) we recorded from (14 
of 17) were tonically active at room temperature, inhibited by warm- 
ing, and excited by cooling (Fig. 4a—f). We term these ‘cool-LNs’. (The 
three remaining LNs had the opposite tuning profile; Extended Data 
Fig. 10.) Cool-LNs were found within both the GABAergic and gluta- 
matergic populations. 

Several features of PN responses are explained by a model where cool- 
LNs inhibit the warm-PNs and warm-cool-PNs. First, cooling excites 
these LNs, which could mask cooling-evoked excitation onto warm-PNs 
(and also attenuate cooling-evoked excitation in warm-cool-PNs), except 
when synaptic inhibition is blocked. Conversely, warming inhibits cool- 
LNs, which should disinhibit PNs, allowing them to become excited 
even when the warm receptor is mutated or warm thermoreceptor cells 
are silenced, but in a manner that depends on intact synaptic inhibition. 

In summary, our results indicate an asymmetry in the central process- 
ing of warm versus cool stimuli. Cool-PNs are driven primarily by exci- 
tation from cool thermoreceptor cells. These simple circuits evoke the 
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from the membrane potential of fast-cool-PNs, 
the more sensitive of the two cool-PN types. Each 
line represents a different experiment. The 
population of cool thermoreceptors must be at least 
this sensitive to warming. 


classic concept of the labelled line. By contrast, warm-PNs and warm- 
cool-PNs receive excitation from both the warm and cool pathway, as 
well as potent crossover inhibition from the cool pathway. This more 
complex arrangement is reminiscent of some circuits in the mamma- 
lian spinal cord (Supplementary Discussion). 

In warm-PNs (and warm-cool-PNs), crossover inhibition from the 
cool pathway may help to cancel non-thermal activity (noise) that is 
correlated among warm and cool thermoreceptor cells. Correlated activity 
could arise from mechanical forces that bend the arista”, thereby deform- 
ing and potentially depolarizing both warm and cool cells”. Unlike the 
thermal responses of these cells, which are anti-correlated, correlated 
activity in warm and cool thermoreceptor cells would be cancelled by 
crossover inhibition. Crossover inhibition might also increase sensitivity 
to small warming steps, because cool thermoreceptor cells are sensitive 
to warming, even though they are inhibited rather than excited by warm- 
ing (Fig. 4g, h). Notably, some insects lack warm receptor cells entirely 
and yet possess both cool-PNs and warm-PNs. It has been suggested that 
disinhibition might be the only source of warming-evoked excitation in 
the warm-PNs of these insects****. In mammals, many cool peripheral 
thermoreceptor cells are inhibited by warming, and warm peripheral 
thermoreceptor cells are inhibited by cooling’. Thus, the mammalian 
spinal cord might also use crossover inhibition, because each receptor 
type is informative about both warming and cooling. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Fly stocks. The genotypes used in each figure are as follows: 

Figure la~d: GMR95C02-Gal4,pJFRC2-10X UAS-IVS-mCD8::GFP 

Figure le-h: GMR49B06-Gal4,pJFRC2-10XUAS-IVS-mCD8::GEP and GMR67D03- 
Gal4,pJFRC2-10X UAS-IVS-mCD8::GEP. 

Figure 2a—-c: GMR95C02-Gal4,pJFRC2-10X UAS-IVS-mCD8::GFP 

Figure 2d: Gr28bM?**8:GMR95C02-Gal4,pJFRC2-10XUAS-IVS-mCD8::GFP 
Figure 2e, f: LexAOp-HA-Ort/Gr28b.d-LexA;GMR95C02-Gal4,pJFRC2-10XUAS- 
IVS-mCD8::GFP/+ 

Figure 2g: GMR95C02-Gal4,pJFRC2-10XUAS-IVS-mCD8::GEP (wild type) and 
Gr28bM¥°°888GMR95C02-Gal4,pJFRC2-10XUAS-IVS-mCD8::GEP (Gr28b.d ‘~) 
Figure 3a—c: GMR54A03-Gal4,pJFRC2-10XUAS-IVS-mCD8::GFP 

Figure 3d: Gr28b™?°°*°8;GMR54A03-Gal4, pJFRC2-10XUAS-IVS-mCD8::GFP 
Figure 3e-g: LexA Op-HA-Ort/Gr28b.d-LexA;GMR54A03-Gal4,pJFRC2-10XUAS- 
IVS-mCD8::GFP/+ 

Figure 4a-f: pJFRC7- /+;Gad1-Gal4/+ 

Figure 4g: 20XUAS-GCaMP3/+;Gr28b.d-Gal4/+ (warm cells) and 20XUAS- 
GCaMP3/+;;GMR79C04-Gal4/+ (cool cells) 

Figure 4h: GMR95C02-Gal4, pJFRC2-10X UAS-IVS-mCD8::GFP 

The Gal4 lines used to drive transgene expression in specific cell types are as follows: 
GMR54A03-Gal4: 1 warm-cool-PN 

GMR49B06-Gal4: 1 slow-cool-PN 

GMR67D03-GalA4: 1 slow-cool-PN (different from that labelled by 49B06) 
GMR95C02-Gal4: ~4 fast-cool-PN and 1 warm-PN 

GMR52G03-Gal4: ~3 glutamatergic LNs (see Extended Data Fig. 10) 
GMR91H10-Gal4: ~3 glutamatergic LNs (see Extended Data Fig. 10) 
GMR79C04-Gal4: 3 cool thermoreceptor cells in the arista 

Gr28b.d-Gal4: 3 warm thermoreceptor cells in the arista 

Gad1-Gal4: multiple GABAergic LNs 

Flies (Drosophila melanogaster) were raised on standard cornmeal agar medium 

supplemented with rehydrated potato flakes on a 12 h light/dark cycle at 25 °C. All 
experiments were performed on adult female flies between 6h and 2 days post- 
eclosion. The GMR Gal4 lines are described previously’* and were obtained from 
the Bloomington Drosophila Stock Center (BDRC). The GMR line that labels the 
warm —cool-PN (54A03) also stochastically labelled 1-2 additional nearby somata; 
we were not able to obtain recordings from these additional neurons due to weak 
and unreliable labelling. The GMR line that labels three cool thermoreceptor cells 
in the arista (79C04) was identified based on a visual screen for expression in the prox- 
imal antennal protocerebrum and was validated using GCaMP3 imaging of the 
arista (Fig. 4g and Extended Data Fig. 6); it reliably drives expression of fluorescent 
reporter transgenes in only three cells in the arista, in contrast to a previously- 
identified line, NP4486-Gal4 (ref. 2), which can drive expression in >3 cells in the 
arista on a stochastic basis (data not shown). Gr28b.d-Gal4 was first published in 
ref. 13. Gad1-Gal4 (line 2B) was constructed by Toshi Kitamoto using a Gad1 
promoter fragment and was first published in ref. 26, where it was called ‘Gad2B- 
Gal4’, Gr2gb23888 was reported in ref. 27, and also in ref. 3 (where it was termed 
the ‘M? allele); we obtained it from the BDRC. pJFRC2-10X UAS-IVS-mCD8::GFP 
(su(Hw)attP1) and 26XLexAop2-mCD8::GFP(attP2) were reported in ref. 28 and 
were obtained from the BDRC. 20X UAS-GCaMP3 (attP 18) was reported in ref. 29 
and obtained from the BDRC. pJFRC7-20XUAS-IVS-mCD8::GFP (attP40) was 
reported in ref. 30; before that report, the same transgene was described in another 
insertion site**. UAS-nls:GFP was obtained from the BDRC. LexAOp-HA-Ort was 
a generous gift from C. -H. Lee. 
Generation of transgenic flies. The Gr28b.d promoter fragment was PCR amplified 
from genomic DNA and assembled into the EcoRI/KpnI digested pBPnlsLexA:: 
GADflUw vector (Addgene) using the Gibson Assembly Master Mix (NEB). The 
PCR primers used were 5'- GGCCCTTTCGTCTTCAAGAATTCGTCACCATG 
GCCCTGCACTTCACC-3’ and 5’- GGGTGGCATTTTGATTGCTAGCGGTA 
CCTCAACGTATTTTTAAAATATTTATAGTATGTA-3’. The underlined regions 
indicate regions of overlap with the vector. Constructs were sequenced before injec- 
tion into y,w;attP40 embryos. Genetic Services, Inc. provided transgenic production 
services. The expression patterns of transgenic flies were verified by crossing each 
LexA line to a GFP reporter and examining GFP expression in the arista and the 
proximal antennal protocerebrum (Extended Data Fig. 6). 

The LexAOp-HA-Ort vector was constructed by inserting the 2xHA-ORT frag- 
ment from pUAST-2xHA-ORT (ref 31) into the pLOT vector™, and it was injected 
in y,w embryos via P-element transformation. 

Electrophysiological recordings. In vivo whole-cell patch clamp recordings from 
neural somata were performed under visual control using an upright compound 
microscope to direct the electrode to a GFP-positive soma. The fly was restrained 
in a platform which allowed the brain and antennae to be bathed in saline while 
most of the fly remained dry. Recordings were performed essentially as previously 
described”, except for the following modifications detailed in ref 20. The head was 


rotated 180° around the neck connective, so that the ventral side of the brain was 
facing upwards and therefore accessible to visualization via the water-immersion 
objective above the preparation. The fly remained alive throughout the experiment 
even when the head was rotated in this manner, as evidenced by continual spon- 
taneous movements of the body, as well as normal brain activity. It was necessary 
to rotate the head because both the PNs and the LNs that innervate the proximal 
antennal protocerebrum have somata ventral to the antennal lobes, and so are 
inaccessible when the brain is dorsal-side up. 

The brain was perfused in external saline containing 103 mM NaCl, 3 mM KCl, 
5mM N-tris(hydroxymethyl) methyl-2-aminoethane-sulfonic acid, 8 mM treha- 
lose, 10 mM glucose, 26 mM NaHCOs, 1 mM NaH3PO,, 1.5 mM CaCl, and 4mM 
MgCl, (osmolarity adjusted to 270-275 mOsm). The saline was bubbled with 95% 
O2/ 5% CO, to a pH of 7.3. The internal solution for patch-clamp pipettes were 
contained the following: 140 mM potassium aspartate, 10 mM HEPES, 1 EGTA, 
4mM MgATP, 0.5 mM Na3GTP, 1 mM KCl, and 13 mM biocytin hydrazide. The 
pH of the internal solution was adjusted to 7.2 and the osmolarity was adjusted to 
~265 mOsm. 

In initial experiments, we filled all recorded neurons with biocytin to visualize 
their morphology. Some of the lines we used in this study drive Gal4 expression in 
more than one neuron in the region where the somata of thermosensory PNs are 
located (just ventral to the antennal lobe), but we found that each morphological 
type of neuron had characteristic electrophysiological properties and temperature 
responses, and so we could accurately identify each type within the relevant Gal4 
line based on these characteristics alone. Hence in subsequent recordings, neurons 
were classified based on these criteria and not every cell was filled. However, PN 
recordings made in the Gr28b.d mutant background were always filled, and LNs 
were also always filled. 

Recordings were performed with an Axopatch 200B amplifier (Axon Instruments). 
Recorded voltages were low-pass filtered at 5 kHz and digitized at 10 kHz. The seal 
conductance can create a small tonic depolarization in small neurons** so in many 
experiments we injected a small amount of constant hyperpolarizing current to 
bring the cell back down to its native resting potential. We estimated the native 
resting potential of all these cell types to be —40 mV, based on measuring spon- 
taneous spiking in cell-attached mode before rupturing the seal, and then match- 
ing the spontaneous spike rate in whole-cell mode**. Spontaneous spikes are typically 
not visible in cell-attached recordings from warm-PNs, and so we assumed a resting 
potential of —40 mV for these cells. In most experiments where a cell was depo- 
larized as a result of blocking synaptic inhibition, we returned the cell to its original 
potential before measuring its responses to thermal stimuli, so as to avoid any con- 
founding effects of depolarization per se. All reported membrane potential values 
are uncorrected for the liquid junction potential, which is —13 mV (ref 33). 

In some experiments, we used picrotoxin (100 UM) and CGP54626 (50 11M) to 
block synaptic inhibition mid-way through the experiment. These drugs were added 
to the saline perfusate. In most experiments where a cell was depolarized as a result 
of blocking synaptic inhibition, we returned the cell to its original potential before 
measuring its responses to thermal stimuli. This procedure minimized any con- 
founding effects of depolarization per se. In warm-cool-PNs, blocking synaptic 
inhibition tended to induce oscillations in the membrane potential, and so small 
effects of this manipulation should be interpreted with caution. 

LN recordings. Flies with Gad1-Gal4 driving pJFRC7-20XUAS-IVS-mCD8::GFP 
were analysed with dual confocal immunofluorescence microscopy, using anti- 
CD8 and anti-GABA antibodies, as previously described”. We found that 90% of 
CD8::GFP- positive cells in the cluster of cell bodies just ventral to the antennal lobe 
were also GABA-positive. Because we restricted our electrodes to cell bodies in that 
region, and we also filled each recorded cell with biocytin to verify that it inner- 
vated the proximal antennal protocerebrum, it is likely that almost all the LNs we 
recorded using this Gal4 line are GABAergic. In these experiments, around two- 
thirds of cells that we recorded from in the Gad1-Gal4 line did not respond to our 
thermal stimuli and were discarded; many of these are likely to represent antennal 
lobe PNs and LNs, based on the morphologies we observed when we filled a subset 
of these cells with biocytin. We used a similar procedure to target our electrodes to 
the glutamatergic LNs: flies with either GMR52G03-Gal4 or GMR91H10-Gal4 
driving pJFRC2-10XUAS-IVS-mCD8::GFP were immunostained using anti-CD8 
and anti-dVGluT antibodies, as previously described”. We found that all the 
CD8::GFP- positive cells in the cluster of cell bodies just ventral to the antennal 
lobe were dVGluT- positive. As before, we restricted our electrodes to cell bodies in 
that region, and we filled each recorded cell with biocytin to verify that it arbourized 
in the proximal antennal protocerebrum. 

Thermal stimulus delivery. Because all neurons are sensitive to temperature, and 
because temperature gradients between the recording and ground electrodes can 
produce electrical artefacts, we wanted to spatially restrict temperature changes to 
the peripheral thermoreceptor cells in the antenna to specifically study how peripheral 
signals are processed in the brain. Therefore, we designed a jet micro-thermode 
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capable of delivering a fast temperature stimulus to a ~150 ym target (Extended 
Data Fig. 1). The thermode system consists of two components: a room-temper- 
ature, laminar flow saline perfusion chamber (Warner Instruments, RC-26GLP), 
anda probe that delivers a jet of temperature-controlled saline. The fly is placed at 
the centre of the chamber and the probe is positioned ~ 100 um from the ipsilat- 
eral antenna, oriented so that the jet of saline flows across both aristae and both 
antennae but not the rest of the fly. The tip of the arista is immobilized with light- 
curing adhesive. The laminar flow of the perfusion chamber ensures that the saline 
jet continues on past the antenna and out of the chamber without substantial mixing 
with the rest of the bath. The bath temperature was kept at 23-25 °C. We verified 
that the baseline firing rate of all PN types adapted almost completely within a few 
minutes to temperature changes within this range. The hydrodynamics of the 
system were visualized by adding a solution of 500 uM phenol red to the saline jet. 
We adjusted the flow rates of the perfusion chamber and saline jet until no visible 
mixing occurred. Simultaneous thermocouple measurements (see Temperature 
Measurements, below) were performed to ensure that the temperature of the bath 
near the brain remained stable when temperature stimuli were presented to the 
antenna. The intrinsically thermosensitive neurons of the brain’? would therefore 
be insensitive to our stimuli. As additional controls for the spatial specificity of our 
thermal stimuli, we verified that cutting the antennal nerves bilaterally abolished 
responses in all PN types. Additionally, removing the aristae bilaterally essentially 
eliminated all temperature responses in the fast-cool-PNs (see Fig. 1b), and it 
completely eliminated the responses of the warm-PNs, warm-cool-PNs, and glu- 
tamatergic LNs (data not shown). 

The jet was produced by sending saline from a reservoir (pressurized to ~5 p.s.i.) 
through a narrow tube that terminated in a 160 |1m nozzle (Small Parts, 0.0063” ID 
polyimide tubing, TWPT-0063-30). Upstream of the nozzle, the tube passed through 
a heat exchanger made from two concentric pipes. The outer pipe was fed with a 
constant flow of water pumped in from a hot bath (~40 °C) or a cold bath (~0 °C). 
Water from only one of the baths flowed into the heat exchanger at a time. A system 
of three three-way solenoid valves selected between hot mode and cold mode. 

The heat exchanger was powerful enough to bring the saline jet (as measured at 
the antenna) to ~13 °C in cold mode and ~34°C in hot mode. After switching 
modes, the jet temperature took ~7 s to plateau. During this slow transition, the 
valves could also stop flow entirely to the outer pipe (no-flow mode) when the heat 
exchanger was at room temperature (~24 °C). By switching between these three 
modes, we could achieve slow temperature transitions between three different tem- 
peratures: ~13 °C, ~24°C and ~34 °C. This method was the basis for our slow 
cool and slow warm stimuli. 

Finally, a separate ‘fast’ solenoid valve was installed in line with the pressurized 
saline tube. Opening and closing this fast valve turned the jet on or off. When the 
jet is turned off, the steady laminar flow of the bath quickly washes over the antenna, 
bringing it back to bath temperature (~24 °C). Switching this valve allowed for fast 
(~100 ms) transitions between a hot (or cold) jet and the ~24 °C bath. This method 
was the basis for our fast cool and fast warm stimuli. 

To produce small temperature steps, the heat exchanger was heated to ~34 °C, 
then cooled with ice so that the jet delivered saline that was only a few degrees warmer 
than the bath. We then gradually cooled the heat exchanger with ice while periodically 
opening and closing the fast valve to produce a fast temperature change at the 
antenna (within 100 ms) to a temperature just above 24 °C. Over several minutes, 
the temperature in the heat exchanger progressively fell to 24 °C, and then con- 
tinued to fall. In this manner, we were able to finely sample a range of temperatures 
several degrees above and below 24 °C. Because the stimulus control was performed 
manually in this case, we could not achieve precisely the same temperature stimuli 
in every experiment. We therefore recorded the temperature at the antenna in each 
stimulus trial and post hoc binned the data by temperature to construct the sensi- 
tivity plots showing average responses to small temperature steps. In the course of 
these experiments, we observed that central neurons did not respond when the jet 
temperature precisely equalled the bath temperature (to the limits of precision of 
our thermocouple), indicating that these neurons do not respond to any mecha- 
nical effects of the jet. 

Temperature measurements. Temperature measurements at the location of the 
antenna were made using K-type thermocouples with a 75-100 jum tip (Omega Engi- 
neering, CHAL-001 thermocouple and SMPW-CC-K-M connector). One wire of 
the thermocouple was threaded into a microbore Teflon tube (Cole-Parmer, EW- 
06417-11) to provide mechanical rigidity and insulation between the two thermo- 
couple wires. The thermocouple was mounted on a micromanipulator and its tip 
was positioned entirely within the saline jet, which included 500 1M phenol red for 
visualization. Thermocouple measurements were acquired and digitized at 99 Hz 
by a National Instruments USB-9213 Thermocouple Measurement Device. Temper- 
ature measurements displayed in the figures represent the mean of several experi- 
ments with the same commands to the stimulus device; because these measurements 


LETTER 


were highly consistent across experiments (generally within 1 °C of the mean) we 
did not measure the temperature in every experimental replicate. 
Histochemistry. In some experiments (noted above), the morphology of the recorded 
neuron was visualized after recording by incubating the brain with a fluorescent 
conjugate of streptavidin, as published previously*’. Immunohistochemistry was 
performed as described previously~’. Primary antibodies were obtained from the 
following sources (with dilutions in parentheses): mouse nc82 from the Develop- 
mental Studies Hybridoma Bank’ (nc82-s, 1:50), rat anti-CD8 from Invitrogen 
(MCD0800, 1:200), rabbit anti-GABA from Sigma’ (A2052, 1:200), rabbit anti- 
dVGluT (1:500; gift of A. DiAntonio, ref. 19). Secondary antibodies (Invitrogen) 
were used at a 1:250 dilution. To reconstruct neuronal morphology from biocytin 
fills, we hand-traced the skeletonized morphology using the Simple Neurite Tracer 
plugin in Fiji, using the Fill Out command to automatically generate a 3D volume, 
which we subsequently converted to a z projection. The morphology of the slow- 
cool-PN has been described previously”*. 

Calcium imaging. Cool and warm cells were imaged in separate experiments. 
Widefield illumination was generated with a 100 W mercury arc lamp (Olympus) 
attenuated with a ND-25 neutral-density filter, band-pass filtered at 540-580 nm, 
and delivered to the specimen through a 40 X water-immersion objective mounted 
in an upright compound microscope (Olympus BX51WIF). Phenol red was omitted 
from the saline jet during these experiments. Images were acquired using a sCMOS 
camera (Hamamatsu ORCA-Flash4.0 V2) with a 50 ms exposure time. Thermal 
stimuli were delivered in fast mode (that is, with a rise time of 100 ms or less). Data 
was analysed using custom MATLAB software. Calcium imaging confirmed that 
Gr28b.d-Gal4 drives expression in warm thermoreceptor cells in the arista, whereas 
GMR79C04-Gal4 drives expression in cool thermoreceptor cells in the arista (Ex- 
tended Data Fig. 6). 

Histamine iontophoresis. For histamine iontophoresis, a high-resistance (~ 100 MQ) 
sharp pulled glass micropipette was filled with a solution of 1 M histamine dihy- 
drochloride in saline. The pipette was inserted into the ipsilateral third antennal 
segment, and histamine was ejected using a 0.5-3 s pulse of positive current (300 nA) 
applied with an iontophoresis current generator (Model 260, World Precision 
Instruments) gated by a TTL pulse. Histamine ejection was initiated 4-5 s before 
the onset of the thermal stimulus. A constant negative backing current of 40 nA 
was applied to retain histamine in the pipette between ejections. These ejection 
parameters were chosen such that there was essentially no effect of histamine ion- 
tophoresis in control genotypes that lacked the LexA transgene (Extended Data 
Fig. 7), and the brief electrical artefact produced by the iontophoresis current pulse 
was small. Although some thermosensory PNs arbourize bilaterally, in pilot exper- 
iments where we removed the contralateral antenna we observed no clear alteration 
in the thermal responses of these cells, and so we performed histamine iontophor- 
esis only in the ipsilateral antenna. We chose to iontophorese histamine into the 
antenna in these experiments because in pilot studies we found that iontophores- 
ing histamine into the brain had effects on the central neurons that did not depend 
on the LexAOp-HA-Ort transgene. 

We also created a LexA line designed to specifically drive expression in the cool 

thermoreceptor cells by converting the GMR79C04-Gal4 line to a LexA line using 
established methods”, and we used this to drive expression of the histamine 
receptor in the cool thermoreceptor cells. However, when we attempted to silence 
the cool cells in this manner, we did not substantially alter the thermal responses of 
fast-cool-PNs, which clearly receive almost all their thermal input from the arista. 
The failure of this positive control is likely due to low transgene expression levels in 
the cool thermoreceptor cells, because GFP signals driven by this LexA line were 
also weak. 
Data analysis. Reported numbers of replicates (n values) always represent numbers 
of cells recorded; in almost all cases, only one cell was recorded per brain. Centre 
values represent the mean and error bars represent s.e.m. computed across cells 
(experiments). No formal statistical calculations were used to pre-determine sample 
sizes. Sample sizes were qualitatively governed by the observed cell-to-cell reliability 
in the measurements we performed in pilot experiments. Our sample sizes are 
similar to those generally employed in the field. Data distribution was assumed to 
be normal with similar variance between the groups that were compared statist- 
ically, but this was not formally tested. All statistical tests were two-sided. In no 
case were flies from the same genotype assigned to distinct experimental treatments, 
and so no randomization procedure was needed. No blinding was performed dur- 
ing experiments or analysis. 

Spikes were detected using custom software in Igor Pro. Peri-stimulus time his- 
tograms were generated by calculating the firing rate in 50 ms bins that overlapped 
by 25 ms. Data for each cell represents an average of at least 4 trials of the same 
stimulus for electrophysiology experiments, and an average of 3 trials for calcium 
imaging experiments. All error bars represent s.e.m. computed across cells, as do 
the shaded bands in the peri-stimulus time histograms. For calcium imaging exper- 
iments, fluorescence was measured within a region of interest near the base of the 
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arista that was drawn so as to contain the somata of 1-2 GCaMP-expressing cells in 
the same focal plane. 

To compare responses to large temperature steps, we counted spikes ina 2 s period 
after step onset. To compare the sensitivity of cells to small temperature steps, we 
measured both mean temperature and mean neural activity (either firing rate or 
membrane potential) in a time window 100-400 ms after opening the valve for all 
cells except the warm-cool-PN, where we used a window of 100-200 ms after valve 
opening. Where we measured the mean membrane potential within a time window, 
we first low-pass filtered the membrane potential at 10 Hz to remove spikes. Because 
our device does not permit forward control of temperature with sub-degree pre- 
cision, the exact temperatures we used were slightly different in each experiment. 
We therefore binned the data for the sensitivity analyses according to the measured 
temperature in each trial, averaging together different measurements in the same 
temperature bin within the same experiment. For each temperature bin we then 
computed then mean neural activity across experiments, and also the s.e.m. across 
experiments to generate the sensitivity plots; the x-axis value of each data point in 
the sensitivity plots represents the middle of each temperature bin. In order to 
perform tests of statistical significance on the sensitivity data, we used an iterative 
approach to avoid a multiple comparisons error. There is good reason to think there 
might be a true difference for small temperature steps that would not be evident for 
larger temperature steps, but we did not know a priori what temperature changes 
these neurons might be sensitive to. Therefore, for a given pair of genotypes and a 
given stimulus polarity (warming or cooling), we first compared responses to the 
smallest temperature step (0.125 °C) using an unpaired t-test. We then adjusted the 
threshold for significance («) to reflect the cumulative number of tests performed 
(n) using a Bonferroni correction (& = 0.05/n) and proceeded to test the adjacent 


temperature step (0.425 °C), and so on. This procedure maximized our statistical 
power for the smallest steps. Because statistical power decreased for increasingly 
larger steps, this procedure is appropriate for identifying the smallest step where a 
difference might occur, but not for identifying the entire temperature range over 
which there might be a difference. 
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Extended Data Figure 1 | Jet micro-thermode schematic. a, Schematic 
showing the major components of the jet micro-thermode. A pressurized 
reservoir forces saline through the inner chamber of a heat exchanger. The 
saline is then expelled as a jet through a 160 um nozzle. This jet can be quickly 
turned on and off via a two-way solenoid valve (the ‘fast’ valve). The outer 
chamber of the heat exchanger is fed by two pumps which circulate water from 
a hot and a cold water bath. Two three-way solenoid valves control the flow of 
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water into the exchanger. A third valve (not shown) serves to return the outflow 
of the heat exchanger to the same bath that is feeding it. b, c, In hot mode 
and cold mode the heat exchanger is heated or cooled by the continual 
circulation of water from one of the two baths through the outer chamber. 

d, In the no-flow configuration, circulation in the outer chamber is stopped. 
This keeps the heat exchanger at a fairly constant temperature, stabilized by the 
thermal mass of the water in the outer chamber. 
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Extended Data Figure 2 | Morphologies of neurons in the proximal 
antennal protocerebrum. Images are z projections of coronal sections (dorsal 
is up). The soma remains sealed to the patch pipette at the end of each 
recording, and so the soma and the proximal part of the primary neurite are not 
present in the reconstructed morphologies; they are indicated schematically 
(soma with a black ball and primary neurite with a thick black line). 
Morphologies shown here were representative of all filled examples of the same 
neuron type (except as noted below). Regions are colour-coded according to the 
key in f. a, A fast-cool-PN. These neurons have dendrites in the proximal 
antennal protocerebrum and axons projecting to two regions of the lateral 
protocerebrum (posterior lateral protocerebrum and posterior slope). Some 
fast-cool-PNs arbourized unilaterally (rather than bilaterally) in the proximal 
antennal protocerebrum; we pooled data from both morphologies because their 
physiological properties were indistinguishable. b, A slow-cool-PN. These 
neurons have dendrites in the ipsilateral proximal antennal protocerebrum and 
axons projecting to a subregion of the calyx of the mushroom body. This 
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morphology has been described previously”® (that study called this neuron a 
‘transverse antennal lobe projection neuron’). c, A warm-PN. These neurons 
have dendrites in the ipsilateral proximal antennal protocerebrum and axons 
projecting to two regions of the lateral protocerebrum (posterior lateral 
protocerebrum and posterior slope). d, A warm-cool-PN. These neurons 
have dendrites in both the ipsi- and contralateral proximal antennal 
protocerebrum, as well as the ventral-posterior margin of the antennal lobe. 
Their axons project to two regions of the lateral protocerebrum (posterior 
lateral protocerebrum and posterior slope). e, Two GABAergic LNs. All LNs we 
encountered in this study were bilateral. We define an LN here as a neuron that 
does not extend processes into any region except the proximal antennal 
protocerebrum (and the midline commissure that links the right and left copies 
of this brain region). Most LNs arbourized broadly within the posterior 
antennal protocerebrum and so are well-positioned to mediate cross-talk 
between warm and cool pathways. f, Locations of colour-coded brain regions in 
a coronal section through the brain. 
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Extended Data Figure 3 | Adaptation to small temperature steps. except for the warm-cool-PNs, where it was from 100 to 200 ms). Data in black 
a-d, Sensitivity to small temperature steps, shown as the mean change in firing are reproduced from Fig. 1c (a, fast-cool-PNs), Fig. 1g (b, slow-cool-PNs), 
rate versus the change in temperature, averaged across experiments, +s.e.m. Fig. 2g (c, warm-PNs) and Extended Data Fig. 8 (d, warm-cool-PNs). Note that 
(n = 5-9). Data in red are measured during a late period after stimulus slow-cool-PNs show little adaptation to large temperature steps (Fig. 1). 
onset (from 4 to 5s after valve opening). Data in black are measured duringan However, their responses to small steps adapt strongly, like those of the other 
early period after stimulus onset (from 100 to 400 ms after valve opening, PN types. 
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Extended Data Figure 4 | Innervation patterns in the proximal antennal 
protocerebrum. Images are single coronal confocal sections through the 
proximal antennal protocerebrum, except where noted below. Magenta is 
neuropil (nc82 immunofluorescence). The approximate boundary of the 
proximal antennal protocerebrum is indicated in white outline. a, Axon 
terminals of cold peripheral neurons and warm peripheral neurons. GFP 
expression is driven by specific Gal4 lines (GMR79C04-Gal4 for the cool cells, 
Gr28b.d-Gal4 for the warm cells). As noted previously’, the cool peripheral cells 
project to the lateral part of this brain region, whereas the warm peripheral 
cells project more medially. Neuropils surrounding the proximal antennal 
protocerebrum are labelled as LP (lateral protocerebrum), LAL (lateral 
accessory lobe) and SOG (subesophageal ganglion). The schematic on the right 
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shows a coronal section of the whole brain with the location of the proximal 
antennal protocerebrum indicated by white outlines. b, Dendrites of three types 
of projection neurons. Each cell was filled with biocytin and visualized using 
a fluorescent streptavidin conjugate. The dendrites of the fast-cool-PNs reside 
in the lateral part of the proximal antennal protocerebrum. The same is true of 
the dendrites of the slow-cool-PNs (data not shown). The dendrites of the 
warm-PNs overlap with the axons of both warm and cool peripheral neurons. 
The dendrites of warm-cool-PNs are sparser than those of other PN types, so in 
a single section only scattered fragments of dendrite are visible; therefore a z 
projection through the entire proximal antennal protocerebrum is shown in the 
image below. 
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Extended Data Figure 5 | Circuit contributions to cool-PN responses. rate in the Gr28b.d mutant (n = 4-5). h, Mean firing rate in the Gr28b.d mutant 
a-d, Fast-cool-PNs. a, Mean firing rate for the fast-cool-PNs, + s.e.m. with inhibition blocked (n = 4). In the Gr28b.d mutant, note that blocking 
Reproduced from Fig. 1b. b, Mean firing rate with synaptic inhibition blocked —_ inhibition produces only modest disinhibition in the cool-PNs. This contrasts 
with picrotoxin and CGP54626 (n = 4-5). c, Mean firing rate in the Gr28b.d with our results in the warm-PNs and warm-cool-PNs, where blocking 
mutant (n = 5-10). d, Mean firing rate in the Gr28b.d mutant with inhibition _ inhibition in the mutant abolished all excitation evoked by the preferred 
blocked (n = 4-6). e-h, Slow-cool-PNs. e, Mean firing rate for the slow-cool- _ stimulus for these neurons (Fig. 2 and Fig. 3). Genotypes are: 


PNs +s.e.m. Reproduced from Fig. 1f. f, Mean firing rate with synaptic a—d, Gr28bMB888:GMR95C02-Gal4, pJFRC2-10XUAS-IVS-mCD8::GFP and 
inhibition blocked with picrotoxin and CGP54626 (n = 4-5). g, Mean firing e-h, Gr28b™®°°888: GMR67D03-Gal4, pJFRC2-10XUAS-IVS-mCD8::GFP. 
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Extended Data Figure 6 | Validation of the LexA line for warm peripheral 
cells and the Gal4 line for cool peripheral cells. There are six thermoreceptor 
cells at the base of the arista, comprising three warm cells and three cool 
cells”. Gr28b.d-Gal4 labels the three warm cells*. a, b, Validation of the Gr28b.d- 
LexA line for warm peripheral cells. a, We generated a LexA line using a 
Gr28b.d promoter fragment. Here we show that Gr28b.d-LexA labels three cells 
at the base of the arista. This image is a z projection of a confocal stack 
through the base of the arista. Genotype is Gr28b.d-LexA/+;26XLexAop2- 
mCD8::GFP/+. b, Axon terminals in the brain of peripheral neurons labelled by 
Gr28b.d-LexA (left) and Gr28b.d-Gal4 (right), both crossed with appropriate 
CD8::GFP reporter lines. As expected, the medial portion of the proximal 
antennal protocerebrum (outlined in white) is labelled with GFP. Images are 
single coronal confocal sections through the proximal antennal protocerebrum. 
Magenta is neuropil (nc82 immunofluorescence). Image on the right is 
reproduced from Extended Data Fig. 4. c, d, Validation of the GMR79C04-Gal4 
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line for cool peripheral cells. c, GMR79C04-Gal4 labels three cells at the base 
of the arista. Combining this driver with Gr28b.d-Gal4 labels six cells, 
indicating that these drivers label mutually exclusive populations of aristal 
neurons. These images show z projections of confocal stacks through the base of 
the arista. Genotypes are Gr28b.d-Gal4/+;UAS-nls-GFP/+ (left), UAS-nls- 
GFP/+;GMR79C04-Gal4/+ (middle), Gr28b.d-Gal4/+;UAS-nls-GFP/ 
GMR79C04-Gal4 (right). d, Calcium imaging of neurons at the base of the arista 
shows that Gr28b.d-Gal4 labels cells that are excited by warming and inhibited 
by cooling, whereas GMR79C04-Gal4 labels cells that are excited by cooling 
and inhibited by warming. Shown here are representative experiments using 
the ‘large fast step’ stimuli. Genotypes are 20xUAS-GCaMP3/+ ;Gr28b.d-Gal4/ 
+ (warm cell imaging) and 20XUAS-GCaMP3/+ ;;GMR79C04-Gal4/+ 

(cool cell imaging). Note that Fig. 4g shows data for all experiments using ‘small 
fast step’ stimuli. 
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Extended Data Figure 7 | Histamine injection controls. In these experiments | warm-PNs or warm-cool-PNs. a, Recordings from warm-PNs (n = 5). 
the LexA driver (Gr28b.d-LexA) was omitted, but otherwise the experimental | Genotype is LexAOp-HA-Ort/+;GMR95C02-Gal4,pJFRC2-10X UAS-IVS- 


protocol was the same as in Fig. 2e, f and Fig. 3e-g. After histamine was mCD8::GEP. b, Recordings from warm-cool-PNs (n = 4). Genotype is 
injected into the antenna, there was little effect on the thermal responses of LexA Op-HA-Ort/+;GMR54A03-Gal4,pJFRC2-10X UAS-IVS-mCD8::GFP. 
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Extended Data Figure 8 | Sensitivity to small temperature steps in 
warm-cool-PNs. Sensitivity to small temperature steps, shown as the mean 
change in firing rate versus the change in temperature, averaged across 
experiments, +s.e.m. (n = 5-9). Firing rate changes were averaged over a 
window from 100 ms to 200 ms after valve opening. Sensitivity was not 
significantly different in wild type versus the Gr28b.d mutant (unpaired t-tests 
with iterative Bonferroni corrections, see Methods). Blocking inhibition in 
these cells produced oscillatory activity that precluded analysis of responses to 
small steps with inhibition blocked. Genotypes are: GMR54A03-Gal4,pJFRC2- 
10XUAS-IVS-mCD8::GFP (wild type) and Gr28bM?°?8*8;GMR54A03- 
Gal4,pJFRC2-10XUAS-IVS-mCD8::GFP (Gr28b.d~'~). 
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Extended Data Figure 9 | Validating the Gad1-Gal4 line as a marker of 
GABAergic local neurons in the posterior antennal protocerebrum. In pilot 
studies, we found that all the local neurons we encountered in the proximal 
antennal protocerebrum had somata within a distinctive cluster ventral to the 
antennal lobes. We therefore targeted our electrodes to this cluster when we 
used a Gad1-Gal4 line” to drive CD8::GFP expression in putative GABAergic 
neurons. In order to determine if the CD8::GFP-expressing somata in this 


GABA 


LETTER 


Merge 


Sum 


cluster are indeed GABAergic, we performed dual immunofluorescence 
confocal microscopy with anti-CD8 and anti-GABA antibodies. This 
representative image shows that almost all CD8::GFP-expressing somata in this 
region are GABA-immunopositive (90% overall), although not all GABAergic 
somata express CD8::GFP. This image is a single coronal section through 

the cluster of somata. 
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Extended Data Figure 10 | Additional examples of LNs in the proximal 
antennal protocerebrum. For each LN, the sequence of panels is analogous to 
that of Fig. 4: raw traces (a, e), peri-stimulus time histograms (b, f) and 
sensitivity plots (c, g). The morphologies of these cells (d and h) are displayed as 
in Extended Data Fig. 2, with a red outline indicating the boundary of the 
proximal antennal protocerebrum. a-d, A GABAergic LN. This LN is excited 
by warming and is inhibited by cooling. This cell was recorded in the genotype 


A temperature (°C) 


in pJFRC7-20XUAS-IVS-mCD8::GFP/+;Gad1-Gal4/+. e-h, A glutamatergic 
LN. This LN is excited by cooling and is inhibited by warming, and its 
properties are similar to those of all the glutamatergic LNs we recorded from. 
In total we recorded from six glutamatergic LNs, three in the genotype 
GMR52G03-Gal4,pJFRC2-10XUAS-IVS-mCD8::GFP and three in the genotype 
GMR91H10-Gal4,pJFRC2-10XUAS-IVS-mCD8::GEFP. 
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Temperature representation in the Drosophila brain 


Dominic D. Frank', Genevieve C. Jouandet', Patrick J. Kearney’, Lindsey J. Macpherson? & Marco Gallio! 


In Drosophila, rapid temperature changes are detected at the peri- 
phery by dedicated receptors forming a simple sensory map for hot 
and cold in the brain’. However, flies show a host of complex innate 
and learned responses to temperature, indicating that they are able 
to extract a range of information from this simple input. Here we 
define the anatomical and physiological repertoire for temperature 
representation in the Drosophila brain. First, we use a photolabelling 
strategy’ to trace the connections that relay peripheral thermosen- 
sory information to higher brain centres, and show that they largely 
converge onto three target regions: the mushroom body, the lateral 
horn (both of which are well known centres for sensory processing) 
and the posterior lateral protocerebrum, a region we now define as 
a major site of thermosensory representation. Next, using in vivo 
calcium imaging’, we describe the thermosensory projection neurons 
selectively activated by hot or cold stimuli. Fast-adapting neurons 
display transient ON and OFF responses and track rapid tempera- 
ture shifts remarkably well, while slow-adapting cell responses better 
reflect the magnitude of simple thermal changes. Unexpectedly, we 
also find a population of broadly tuned cells that respond to both 
heating and cooling, and show that they are required for normal 
behavioural avoidance of both hot and cold in a simple two-choice 
temperature preference assay. Taken together, our results uncover 
a coordinated ensemble of neural responses to temperature in the 
Drosophila brain, demonstrate that a broadly tuned thermal line 
contributes to rapid avoidance behaviour, and illustrate how stim- 
ulus quality, temporal structure, and intensity can be extracted from 
a simple glomerular map at a single synaptic station. 
Thermosensation provides animals with critical information about 
their environment. We have previously shown that, in Drosophila, rapid 
temperature changes are detected by dedicated hot and cold temper- 
ature receptors in the last antennal segment, the arista, with three cells 
responding to warming and three to cooling’. The projections of these 
sensory neurons target the base of the antennal lobe, a region previously 
referred to as the proximal antennal protocerebrum (recent Drosophila 
brain nomenclature’ refers to this region as the posterior antennal lobe). 
At the posterior antennal lobe, hot and cold temperature receptors form 
two distinct, adjacent glomeruli defining a simple map for temperature 
coding’. Here we focus on identifying the second-order thermosensory 
neurons (referred to as thermosensory projection neurons) asa first step 
towards understanding how this spatially segregated pattern of activity 
is processed to mediate appropriate responses to temperature stimuli. 
Thermosensory projection neurons (tPNs) should display prom- 
inent dendritic posterior antennal lobe (PAL; see Methods) innervation. 
Therefore, we reasoned that two-photon guided conversion of photo- 
activatable GFP (PA-GFP**) could provide a suitable approach to identify 
their projections. We engineered flies where PA-GFP was constitutively 
expressed in all brain neurons, and the hot and/or cold temperature 
receptor terminals were selectively labelled by a red fluorescent protein® 
(Extended Data Fig. 1). Using the red fluorescence as a guide, we then 
photo-converted PA-GFP in either the hot (Fig. 1a) or the cold (Fig. 1b) 
glomerulus, resulting in labelling of ascending axons. Projection neurons 
originating in the PAL follow several widely different trajectories to reach 
their targets (Fig. 1). Among these, one prominent pathway targeted 
the calyx (the input region of the mushroom body) and the edge of the 


lateral horn, from where a subset of axons further extended ventrally 
to terminate in the posterior lateral protocerebrum (PLP, Fig. 1a, b, top 
arrowhead). Another group of tPNs initially followed the same ascend- 
ing tract, but bypassed the calyx, instead veering off laterally to innerv- 
ate the same domains of the lateral horn and PLP (Fig. 1a, b, centre 
arrowhead). A third class of tPNs directly extended axons laterally, fol- 
lowing either an anterior or posterior path that still terminated in the 
same domains of the lateral horn and PLP (Fig. 1a, b, bottom arrow- 
head). Thus, despite following different routes, these pathways largely 
converged onto just three regions: the mushroom body calyx, the lateral 
horn (well-known centres for sensory processing in the fly brain) and 
the PLP, a poorly characterized region that now emerges as a major site 
of thermosensory signal processing. Our data also demonstrated sig- 
nificant overlap between projections innervating the hot and cold glo- 
meruli. Indeed, we only observed a single modality-specific fibre that 
connected the cold sensory terminals to a ‘microglomerulus’ at the edge 
of the calyx (Fig. 1b, open arrowhead; this unusual fibre has been noted 
previously in anatomical studies’). 

These findings raise the question of why thermosensory information 
is split into different pathways, which largely converge onto the same 
target regions. To address this question, we searched the large collec- 
tions of Gal4 driver lines made available by the Janelia Farm FlyLight 
initiative’, as well as the ViennaTile project (http://stockcenter.vdrc.at/), 
and selected lines to target specific tPN populations for functional 


Figure 1 | Projection neuron pathways in the fly thermosensory system. 

a, b, PA-GFP photoconversion directed to the pre-synaptic termini of hot 

(a, red circle) or cold (b, blue circle) temperature receptors labels post-synaptic 
pathways, revealing common projections (filled arrowheads in a and b) 

anda single, cold-specific fibre (empty arrowhead in b), as well as a high degree 
of convergence on three target regions: the mushroom body (MB), lateral 
horn (LH) and posterior lateral protocerebrum (PLP). Mushroom bodies 
were labelled with dsRed as a landmark. c, d, f, h, j, Three-dimensional 
reconstruction of specific pathways. e, g, i, k, Two-photon stacks of brains 
expressing GFP under the control of drivers selectively expressed in 
corresponding tPNs (see Methods for genotypes). Drivers: e, R22C06; 

g, R84E08; i, R60H12; k, R95C02 (cell bodies are boxed). ALT, antennal lobe 
tract; 1, lateral; m, medial; t, transverse. Scale bars, 25 um. 
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studies. Visual inspection of nearly 12,000 confocal stacks (represent- 
ing more than 10,000 lines) confirmed that our PA-GFP experiments 
had identified the major projections originating from the PAL and pro- 
vided an initial list of ~200 drivers targeting tPNs. For this work, we used 
ten representative drivers fulfilling three important criteria: (1) select- 
ive Gal4 expression, as this would help to identify unambiguously the 
neurons of interest; (2) confirmed synaptic connectivity with the sens- 
ory neurons, as detected by GFP reconstitution across synaptic partners 
(GRASP*”, including a new synaptic variant we developed; Extended 
Data Fig. 2); (3) rapid functional responses to naturalistic temperature 
stimuli, recorded in vivo by gently exposing brain tissue through a hole 
in the head cuticle and imaging the activity of tPNs by two-photon mi- 
croscopy’. Finally, we used acute resection of the antennal nerve as a 
means to confirm that the cell’s responses were in fact driven by the 
antennal temperature receptors. 

The drivers listed in Extended Data Table 1 fulfil all these criteria 
and provide a comprehensive repertoire of thermosensory projection 
neurons, while the anatomy of a representative set of tPN cell types 
(reconstructed by transgenic labelling with GFP) is shown in Fig. 1c-k. 
Finally, we confirmed that all identified tPNs displayed the expected 
polarity of a projection neuron (that is, dendrites in the PAL and axon 
terminals in higher brain centres) by targeting expression ofa dendritic 
marker (DenMark"®, Extended Data Table 1) and ofa pre-synaptic GFP 
fusion (syt-GFP"', Extended Data Fig. 3). In all, our screen identified 
seven tPN cell types with distinct innervation patterns and functional 
properties (see below). 

Thermoreceptor neurons in the antenna respond either to cooling or 
heating and define ‘labelled lines’ for temperature coding at the peri- 
phery’. Functional imaging studies revealed second-order neurons that 
were also selectively activated by either cooling or heating (that is, ‘nar- 
rowly tuned’), and specifically connected to either the cold or hot tem- 
perature receptors (as demonstrated by GRASP, Extended Data Fig. 2 
and Extended Data Table 1). For example, robust, sensitive responses 
to cooling were reliably observed from neurons innervating the cold- 
specific fifth transverse antennal lobe tract (ts ALT) (Fig. 2) and display- 
ing selective GRASP with cold temperature receptors (Extended Data 
Fig. 2, R60H12), while we recorded robust heating responses from cells 
innervating the lateral ALT pathway and selectively connecting with 
hot temperature receptors (VT46265; a full description of the properties 
of the various cell types is provided in Extended Data Table 1). 

Narrowly tuned projection neurons could be categorized based on 
the decay profile of their calcium responses as either ‘slow-adapting’ or 
‘fast-adapting’. Slow-adapting tPNs, such as the cold-specific ts ALT tPN, 
responded to temperature stimuli with calcium transients that persisted 
during the stimulus and even after the temperature had returned to 
baseline (Fig. 2b, arrowheads). As shown in Fig. 2d, the peak responses 
of this cell type scaled with the magnitude of cooling stimuli over a wide 
range of intensities. Yet, as a consequence of slow decay, intracellular 
calcium did not return to baseline when cooling stimuli were rapidly 
interleaved (Fig. 2e). 

In contrast, fast-adapting cells responded to temperature changes with 
a calcium transient that did not faithfully scale with stimulus intensity 
and which was followed by fast decay, as illustrated in Fig. 3 for a hot 
tPN innervating the lateral pathway (Fig. 3a—d; see Extended Data Fig. 4 
for a comparison of fast- and slow-adapting cold cells). As a result of 
fast kinetics, the peak response of this cell type generally preceded the 
stimulus peak (Fig. 3d) and, for larger stimuli, intracellular calcium had 
nearly returned to the pre-stimulus baseline when the temperature was 
still rapidly changing (Fig. 3c). Because of this, these fast-adapting cells 
are unlikely to code information regarding the peak temperature of the 
stimulus (Fig. 3e), yet they were able to track remarkably well a rapidly 
evolving temperature transient (Fig. 3f). 

One of the drivers that we identified is active in a group of six such 
fast-adapting neurons, four of which are activated by cooling and two 
by heating, allowing one to simultaneously record the responses of both 
cell types under two-photon microscopy. Our ‘hot’ stimuli consist ofa 
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Figure 2 | Properties of slow-adapting, cold-activated projection neurons. 
a, Three-dimensional reconstruction of a tPN that innervates the t5AL tract 
(expressing R60H12; cell body in inset). b, c, Representative calcium-imaging 
traces from this tPN. Robust transients are seen in response to cooling 
stimuli of increasing magnitude (b), but not in response to heating (c). Notably, 
intracellular calcium concentration ({Ca”* ];) remains above baseline well 
after the end of cooling (arrowheads in b). d, Stimulus-response plot. Blue dots, 
peak response of a single cell to a defined stimulus, connected dots belong to 
the same cell, n = 5 cells/5 animals. Grey dots, binned averages + s.e.m.; 

bin width = 2.5 °C, bin at zero = 0 °C AT. Responses are proportional to 
cold stimuli (R? = 0.5, P< 0.001; slope = —0.43, 95% confidence interval 
(CI) = [—0.61 to —0.25]) but not to hot stimuli (R? P = not significant). 

e, [Ca’*]; does not return to baseline when cooling stimuli are rapidly 
interleaved with heating ones (black trace = average of 3 responses, 3 cells/3 
animals + s.e.m., in grey; blue trace = average of the 3 stimuli + s.e.m., grey). 
In all panels AF/F indicates fluorescence change over baseline; AT indicates 
temperature change. Scale bar in a, 5 um. 


25 °C 


heating pulse followed by cooling, which quickly brings the temperature 
back to baseline. As expected, we observed a transient calcium response 
in the hot-activated cell type at the beginning of the heating step (Fig. 3g-i, 
‘ON’ response). Interestingly, the cold-activated cell type did not imme- 
diately respond at the onset of the following cooling phase (as would be 
expected for a simple cooling response), but rather with a significant 
delay; that is, at the very end of the temperature transient when the tem- 
perature was again approaching baseline (‘OFF response, Fig. 3i). Even 
in the midst of a rapidly varying temperature transient, ON and OFF 
cell responses alternated, with hot cells tracking the onset of heating, 
and cold cells becoming active when the temperature approached the 
25 °C baseline (Extended Data Fig. 5). 

We note that the specific delay of OFF responses was observed only 
when cooling was preceded by heating, perhaps reflecting inhibitory 
interactions between hot and cold processing circuits. Furthermore, OFF 
responses coincided with the return to baseline even when this was sys- 
tematically varied between ~20 °C and ~30 °C (Extended Data Fig. 6). 
Hence, OFF responses appear to demarcate return to baseline after a 
rapid change, and might for example help the animal maintain course on 
a defined thermocline. Hot-activated projection neurons also displayed 
OFF responses during the heating phase that followed a cooling stimulus, 
but the amount of delay varied between cells (Extended Data Fig. 6). 

Taken together, our results suggest that the distinct kinetics of slow- 
and fast-adapting tPNs may tune each class to relay significantly different 
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Figure 3 | Fast-adapting projection neurons 
display ON and OFF responses to temperature 
stimuli. a, Fast-adapting tPNs responding to either 
hot or cold innervate a lateral pathway (three- 
dimensional reconstruction in green). b, c, A single 
cell in this cluster (expressing R95C02) displays 
rapid but transient responses to hot stimuli of 
different amplitude. Note that, for the larger 
stimulus, the response has already nearly returned 
to baseline while the temperature is still changing. 
d, Box-plot showing the temporal difference 
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between response and stimulus peak (tye) and 
onset (fo), respectively. Owing to its transient 
nature, the response often peaks before the 
stimulus (red line, median; blue box, 25th and 75th 
percentiles; whiskers, data range; n = 15 stimuli, 
AT =3.4+ 0.9, mean + s.d.; 5 cells/5 animals). 

e, Stimulus-response plot for defined heating 
stimuli (green dots, peak response ofa single cell to 
a stimulus, connected dots belong to the same cell, 
n= 5 cells/5 animals; grey dots, binned 

averages + s.e.m.; bin width = 2.5 °C, bin at 

zero = 0 °C AT). Responses of this cell type do not 
correlate with stimulus intensity (R’ P = not 
significant). f, This cell type tracks well a dynamic 
temperature stimulus (black trace, average of 3 
responses, 3 cells/3 animals + s.e.m., in grey; red 
trace, average of the 3 stimuli + s.e.m., grey). 

g-i, The driver R95C02 is expressed in both hot- 
(green arrow in g and traces in i) and cold-activated 
tPNs (purple arrow in g and traces in i). h, A 
sequence of AF/F images of two cell bodies 
responding to a hot stimulus (AT ~ 5 °C, peaking 
at 0 s, same cells as in g); i, single stimulus and 
response traces from the two cell bodies in 

g, recorded simultaneously. i, The hot activated 
tPN responds to a heating stimulus with a rapid 
ON transient, while the cold cell displays a delayed 
OFF response. Scale bars in b and g, 5 ym. 


Figure 4 | Broadly tuned thermosensory projection neurons respond to 
both heating and cooling stimuli. a, Three-dimensional reconstruction of a 
broadly tuned tPN pathway (VT40053; cell bodies in b). c, This cell type 
responds with transients of increasing magnitude to both cooling (top) and 
heating stimuli (bottom) of increasing intensity (shown are representative 
traces). d-f, A maximum-response image clearly shows that the same neurons 
are capable of both cooling (e) and heating (f) responses (AT ~ +5 °C). 

g, Stimulus-response plot for defined heating and cooling stimuli (orange dots, 
peak response of a single cell to a stimulus, connected dots belong to the 
same cell, n = 6 cells/5 animals; grey dots, binned averages + s.e.m.; bin 
width = 2.5 °C, bin at zero = 0 °C AT); the responses are proportional to the 
stimuli in both the cold and hot range (cold, R? = 0.34, P< 0.001, 

slope = —0.20, 95% CI = [—0.31 to —0.09]; hot, R? =0.5, P< 0.001, 

slope = 0.12, 95% CI = [0.07 to 0.17]). Scale bars in b and d, 5 um. 
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Figure 5 | Both broadly and narrowly tuned tPNs are required for rapid 
temperature avoidance in a two-choice behavioural assay. a, b, The VT40053 
(a) and R22C06 (b) drivers are prominently expressed in broadly tuned tPNs, as 
shown by GFP expression (cell bodies are circled; arrows indicate mALT; 
scale bars, 25 um). ¢, d, Silencing this cell type produces flies with significant 
defects in both hot and cold avoidance in a two-choice assay. e, Avoidance 
indices of control flies bearing the Kir2.1 construct but no driver. f, Silencing 
three distinct cold-activated tPNs (expressing VT26020) results in a selective 
defect in cold avoidance (scores of the drivers alone are shown in c, d and 

f as grey bars; avoidance index values were compared by two-way ANOVA, 
asterisks denote a significant interaction between the Gal4 and UAS transgenes, 
P<0.01; error bars indicate mean + s.e.m.; the number of biological 
replicates is in parentheses next to each genotype). 


aspects of thermal stimuli to their targets: slow-adapting neurons appear 
better suited to convey the magnitude of a temperature change, while fast- 
adapting cells may better track its temporal structure (onset and offset). 

In addition to ‘narrowly tuned’ cells (that is, responding to either hot 
or cold), we also discovered a significant class of ‘broadly tuned’ tPNs 
(Fig. 4). Each neuron in this group displayed calcium transients in re- 
sponse to both heating and cooling, the peak of which correlated well 
with the stimulus intensity in either direction (Fig. 4c, g). Unexpectedly, 
broadly tuned tPNs represented the largest group of cells we found 
(~10 cells, a significant number when compared to other projection 
neurons; see Extended Data Table 1) and innervated all major targets 
we described (mushroom body, lateral horn and PLP). Therefore, these 
cells are likely to provide significant drive to higher-order thermosen- 
sory regions during any temperature transient. 

We have previously shown that rapid temperature preference is me- 
diated by the opposing pushes of heat- and cold-aversion’. One attract- 
ive hypothesis is that these neurons could represent a common aversive 
behavioural line. If this is the case, silencing their activity should dam- 
pen aversive responses to both hot and cold stimuli. To test this, we 
selected two drivers prominently expressed in these cells for functional 
studies: VT40053 is active in all the broadly tuned tPNs, but its expres- 
sion also includes additional sensory circuits (Fig. 5a). R22C06 is only 
expressed in about half of the broadly tuned cells, but in very few other 
cells in the brain or ventral nerve cord (Fig. 5b, and see Methods for a 
detailed description of these lines). Importantly, with the exception of 
tPNs, these two drivers show no overlap in expression. 

Driving Kir2.1 (ref. 12) (that is, silencing neural activity by hyper- 
polarization) under the control of either line resulted in a significant 
decrease in the avoidance of both cold and hot regimes in a rapid two- 
choice test’ (Fig. 5c, d), demonstrating that this pathway is indeed re- 
quired for rapid temperature preference. In contrast, expression of Kir2.1 
under the control ofa driver active in three distinct cold-activated tPNs 
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also innervating the medial ALT pathway (VT26020, see Extended Data 
Fig. 7 and Methods for details) produced an avoidance defect that was 
highly selective for cold temperatures (Fig. 5f), suggesting that both nar- 
rowly and broadly tuned tPNs participate in temperature preference. 
We note that flies in which broadly tuned tPNs were inactivated still 
avoided hot temperatures at the upper limit of their tolerance range 
(>35 °C, Fig. 5), indicating that their locomotor/navigational programs 
are intact, and that the avoidance of high temperatures probably involves 
additional circuit components. 

We define here a circuit diagram for the representation of temper- 
ature in the Drosophila brain (Extended Data Fig. 8). ‘Hubs’ of this cir- 
cuit are the PAL, where the activity of hot and cold glomeruli initially 
signals a temperature change, and the triad of mushroom body, lateral 
horn and PLP, where thermosensory stimuli are richly represented by 
an array of differently tuned projection neurons. Taken together, our 
results demonstrate that substantial processing of thermosensory input 
at a single synaptic step can extract information about the sign, onset, 
magnitude and duration of a temperature change, and establish a frame- 
work to understand how complex temperature responses, often hap- 
pening on different timescales, can be orchestrated starting from a 
simple sensory map. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


No statistical methods were used to predetermine sample size. 

Experimental animals and transgenes. The following transgenic flies were con- 
structed for this study: GR28b.d-LexA, expressing LexA under a promoter element 
active in hot receptor neurons'*"*, [R94a-LexA, from a promoter expressed in both 
hot and cold cells (additionally expressed in a small number of olfactory sensory 
neurons"). Aop-syb:spGFP 1-10, a split-GFP 1-10 fragment targeted to pre-synaptic 
sites by fusion to the cytoplasmic domain of synaptobrevin, for GRASP (ref. 8; 
L.J.M. and M.G., unpublished; see also ref. 16). Additional transgenic lines: Aop- 
CD4:spGEP 1-10, UAS-CD4:spGFP 11 (ref. 9) (a gift from K. Scott); UAS-SPA (ref. 2), 
UAS-C3PA (ref. 2), MB-DsRed, syb-Gal4 (all a gift from B. Noro and V. Ruta); 
UAS-Kir2.1 (ref. 12), UAS-CD8:GFP, Aop-Tdtomato (ref. 6), Aop-CD2:GEFP, UAS- 
GCaMPom (ref. 17), UAS-DenMark (ref. 10), UAS-syt:GFP (ref. 11). Gal4 driver 
lines were obtained from the Bloomington Stock centre or VDRC. A full list of 
genotypes is provided as Supplementary Information. 

Two-photon guided conversion of photo-activatable GFP in PAL glomeruli. 
To visualize axonal projections arising from the PAL, we used two-photon guided 
conversion of PA-GFP following the protocol of Ruta et al.’. All imaging was con- 
ducted on a Prairie Ultima two-photon microscopy system equipped with a Coherent 
Chameleon Ti:Sapphire laser, GaAsP PMT, and an Olympus 40 0.9NA or 60 
1.1NA water immersion objective. In brief, we labelled hot temperature receptor 
terminals alone or both hot and cold ones by expression of fluorescent proteins 
(tdTomato (ref. 6) or GFP; under LexA, see above). We dissected 2-4-day-old 
adult flies, and defined 3D ROIs around selected glomeruli while imaging at 945 nm 
(a wavelength that does not activate PA-GFP). Finally, the laser was tuned to 720 nm 
and the volume scanned at low power (10-30 mW, measured at the back aperture 
of the objective) 30 times interleaved by 30 s wait periods. A ~15-min rest allowed 
diffusion of photo-converted PA-GFP. 

Nomenclature of brain regions and 3D reconstruction of axonal pathways. All 
efforts were taken to adhere to the standardized insect brain nomenclature described 
in Ito et al.*. The area defined by the glomeruli formed by hot and cold sensory neu- 
rons (previously referred to as the PAP’) is here referred to as the posterior antennal 
lobe (PAL), as the two regions have been now merged. Indeed, the projections of 
arista neurons were first described by Stocker et al. as targeting the posterior AL'® 
(glomeruli VP2 and VP3). Interestingly, our analysis revealed an additional glomer- 
ulus close to the PAL (probably VP 1) that seems to be innervated by cold-responding 
sacculus neurons (data not shown). However, it was also activated by non-thermal 
stimuli and therefore excluded from further analysis. The mAL, mlAL and IAL fibre 
tracts are described in Ito et al.*. The transverse 3AL and 5AL tracts have been de- 
scribed in Tanaka et al.’’ and Jenett et al.’, respectively. 3D reconstructions of axonal 
tracts (Fig. 1) were carried out on 3D brain volumes pre-aligned to a standard brain, 
available through BrainBase”’. Semi-automated neurite reconstruction was carried 
out by AMIRA on representative ViennaTile driver lines, and information from 
PA-GFP experiments and additional driver lines used to refine neurite models by 
hand. Automated annotation of the projections of driver lines in FlyLight and/or 
BrainBase directly suggested innervation of target regions (mushroom body, lat- 
eral horn and PLP) based on volume reconstructions. We adopted this nomenclat- 
ure (even when, for example in the case of the lateral horn, tPN innervation appears 
confined to the lateral horn margin), and confirmed the presence of putative pre- 
synaptic terminals by expression of syt-GFP (Extended Data Fig. 3). 3D recon- 
structions of the mushroom body and lateral horn shown in Extended Data Fig. 3 
were carried out in AMIRA based on the nc82/bruchpilot staining of the standard 
brain; because the boundaries of the PLP could not be easily defined in nc82/ 
bruchpilot staining, the PLP was not reconstructed but is shown as an outline. 
GRASP, Denmark and syt-GFP visualization and staining. Native GRASP fluo- 
rescence was visualized at the synapses between hot, and hot and cold receptor neu- 
rons and their potential projection neurons in the PAL using two-photon microscopy 
of 2-4-day-old adult dissected fly brains. Immunofluorescence of GRASP was per- 
formed on 4% PFA-fixed brains using antibodies which specifically either bind to 
reconstituted GRASP°-mouse anti-GFP (1:100; Sigma, catalogue no. G6539), or 
that preferentially recognizes spGFP 10-chicken anti-GFP (1:1000, abcam 13970). 
To determine the polarity of tPNs and identify putative pre-synaptic termini, we 
expressed under the control of tPN Gal4 lines either the red fluorescent dendritic 
marker DenMark” or a synaptotagmin-GFP" fusion, respectively, and visualized 
native fluorescence by confocal (DenMark) or 2-photon (syt:GFP) microscopy. 
Functional calcium imaging of tPN responses to temperature stimuli. Our cal- 
cium imaging and temperature stimulation set-up has been previously described in 
detail’. In brief, UAS.GCaMP6m flies or intact fly heads were submerged in AHL? 
and sufficient cuticle surrounding the brain was delicately removed to provide optical 
access to the projection neurons. The preparation was then placed in a custom- 
built microfluidics chamber, covered with a plastic coverslip, and placed on the two- 
photon microscope stage. Rapid temperature changes were achieved by controlling 


the temperature of the medium via a custom-built system consisting of a series of 
three-way valves (Lee Instruments, response time 2ms) and Peltier elements inde- 
pendently controlling baseline, ‘hot’ and ‘cold’ flow, respectively. ‘Complex’ stim- 
uli (Figs 2 and 3 and Extended Data Fig. 5) were obtained by actuating the control 
valves using a sequence of TTL pulses to alternate between hot flow and cold flow. 
The delay between valve trigger and onset of AT was a function of tubing length, 
1-2 s. Baseline experiments (Extended Data Fig. 6) were done by adjusting the tem- 
perature of baseline flow and allowing the preparation to adapt to it for ~1 min 
before recordings. We used rapid heating and cooling pulses' (~2 °C per second), 
as this regime closely approximates what the fly might encounter during a naviga- 
tional saccade. Temperature was recorded using a BAT-12 electronic thermometer 
(time constant 0.004 s, accuracy 0.01 °C, Physitemp). Images were acquired at 256 x 
256 pixel resolution at a rate of 4 Hz on a Prairie Ultima 2-photon microscope 
equipped with a Coherent Chameleon Ti:Sapphire laser, GaAsP PMT, and an Olym- 
pus 40X 0.9NA or 60X 1.1NA water immersion objective. AF/F analysis was carried 
out using custom scripts in MATLAB, base fluorescence was calculated using all 
frames preceding valve trigger (occurring at 5s). All recordings were from cell 
bodies unless otherwise specified. To extract the ‘onset’ ofa given response (Fig. 3e 
and Extended Data Fig. 5b, c) we wrote a script that identifies the first sequence of 
three data points in the trace all of which are two standard deviations above average 
background (‘noise’, calculated from all data points preceding valve trigger); the 
first point in this sequence was used as ‘onset’ and mapped on the corresponding 
time stamp. Temperature was recorded at the fly head during each experiment, and 
was synchronized with imaging by starting the temperature recording and image 
acquisition by the same trigger signal. As the recording of the two variables was 
synchronous (and the distance of the thermo probe from the fly head small and 
relatively constant), we generally observed good time-locking between response 
and stimulus onset. To calculate decay constants of ‘fast’ and ‘slow’ responses (rt) 
we confirmed that the cellular response data fit an exponential decay function, and 
measured the time from peak to 1/e (~37%) of peak value. 

Dose-response statistics. Each point in the dose-response plots (Figs 2d, 3e and 
4g) represents the peak response of one cell body to one presentation of a given 
temperature change, as in the corresponding representative traces shown (Figs 2b, 
3c and 4c). The same cell is challenged with a set of 5-10 stimuli and the resulting 
responses are connected by a line (1 = 5 cells/5 animals). For each cell type, we 
confirmed that the effect of temperature was significant as follows: the bin at zero 
contains the ‘responses’ recorded in control experiments ([Ca**]; maxima of traces 
acquired to control for valve artefacts; that is, triggering the valve system without 
feeding power to the Peltier elements; AT = 0). We performed two sample t-tests 
comparing the zero bin against binned responses to temperature stimuli of ~0.5- 
3 °C (for all cell types, P< 0.05). We then calculated regression and slope statistics 
on all data points across individual flies to test whether the responses were pro- 
portional to stimulus intensity (see figure legends). 

Two-choice behavioural assays for temperature preference. The two-choice assay 
for temperature preference has been described before’. Briefly, 20 3-5-day-old flies 
were ice-anaesthetized and placed in an arena consisting of four 1'’ square, indi- 
vidually addressable Peltier tiles. In each trial, flies were presented for 3 min with a 
choice between 25 °C anda test temperature between 10 °C and 40 °C at 5 °C inter- 
vals, and the position of flies was recorded during each trial (by a BASLER A601FM 
camera) to calculate an avoidance index for each test temperature (AI = (no. flies 
at 25°C — no. flies at test temp)/total no. flies). Avoidance index values for each 
genotype were compared by two-way ANOVA, and asterisks in Fig. 5 denote a 
statistically significant interaction between the Gal4 and UAS transgenes (thresh- 
old P = 0.01,n > 18 biological replicates, see figure for specific n values). Kolmogorov- 
Smirnov tests were used to confirm a normally distributed sample. Homogeneity 
of variance for each data set was confirmed by calculating the Spearman correla- 
tion (p) between the absolute values of the residual errors and the observed values 
of the dependent variable (threshold P = 0.05). Statistical analysis was carried out 
in MATLAB; sample sizes were chosen to reliably measure experimental parameters. 
Experiments did not involve randomization or blinding. The driver VT40053 is 
expressed in nearly all the broadly tuned tPNs that we have observed (~ 10) but its 
expression also includes one cold-activated tPN, olfactory projection neurons”, 
fibres targeting auditory centres”, as well as sensory projections from the taste sys- 
tem” (see http://brainbase.imp.ac.at/). In contrast, R22C06 is selectively expressed 
in a small group of projection neurons projecting to the antennal lobe: a subset of 
the broadly tuned tPNs (5-6), 1-2 cold-activated tPNs and 1-2 olfactory projection 
neurons, but no other cell in the brain (and only two neurons in the ventral nerve 
cord). VT26020 is expressed in three cold-activated tPNs innervating the mALT 
whose axons do not descend to the PLP (see Extended Data Figs 2d, e and 6). 
Expression is also prominent in taste and auditory centres. 
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Extended Data Figure 1 | Targeted photo-conversion of PA-GFP reveals 
candidate tPN pathways. a, Diagram of the fly brain showing the location of 
the PAL. b, Schematic diagram showing the expected location of ‘cold’ and 
‘hot’ glomeruli (blue and red, respectively). c-e, The hot glomerulus is 
visualized by tdTomato, driven by Gr28b.d-LexA (red). e, Targeted 
photo-activation of pan-neuronal PA-GFP (syb-Gal4/UAS-SPA) reveals 
ascending pathways (white arrowheads). Panels d and e are imaged under 
the same conditions, before (d) and after (e) photoconversion. MB, mushroom 
body; PAL, posterior antennal lobe; SEZ, sub oesophageal zone; AL, antennal 
lobe; AN, antennal nerve; SPP, super peduncular protocerebrum. 

Scale bars, 25 tum. 
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Extended Data Figure 2 | Synaptic-targeted GFP reconstitution across at sites of contact between the receptor neurons and fibres of passage (arrow in 
synaptic partners (GRASP) reveals specific connectivity patterns a, and see cartoon below for illustration). In contrast, synaptically targeted 
between antennal temperature receptors and tPN classes. Using an syb-GRASP (or syb:GRASP) (b) shows punctate expression of syb-spGFP1-10 
n-synaptobrevin fragment to target spGFP1-10 to the pre-synaptic membrane _ (red) and specific GRASP signal (green) only at the hot-cell synapses of the 
(see Methods) increases the specificity of GRASP fluorescence at synapses PAL. c, Representative native GRASP fluorescence between either (d, f, h) 
between thermosensory neurons and projection neurons in the PAL. hot and cold temperature receptors (IR94a-LexA; Aop-CD4:spGFP1-10) or 
a, Immunofluorescence staining of GRASP contacts between hot and cold hot only temperature receptors (e, g, i, GR28b.d-LexA; Aop-syb-spGFP 1-10) 
receptor neurons (IR94a-LexA, Aop-CD4:spGFP1-10) and projection neurons _ onto three projection neuron types (VT40053, R60H12 and VT46265, each 
(VT46265, UAS-CD4:spGFP11) shows diffuse expression of spGFP1-10 driving UAS-CD4:spGFP11). GRASP connectivity corresponds well to 
throughout the membranes of the hot and cold receptor neurons (red), in their tuning (see text and Extended Data Table 1 for details). Scale bars in 


addition to the GRASP signal at the synapses between hot cells and VT46265 —d-i, 25 xm. 
projection neurons (green). Some non-synaptic GRASP fluorescence is seen 
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Extended Data Figure 3 | Target regions of tPN cell types. Transgenic 
expression of synaptotagmin-GFP reveals the location of putative pre-synaptic 
termini for each of the identified tPN cell types. a, Three-dimensional 
reconstruction of key target regions (the mushroom body and lateral horn, 
based on nc82 staining) showing their relationship to the tPN pathways 
(colour and abbreviations are as in Fig. 1; note that the boundaries of the PLP 
are not clearly demarcated by nc82 staining, hence the PLP was not 


VT26020 


R95c02 


Gal4> syt:GFP 


reconstructed but outlined; see Methods for details). b, d, f, h, j, 1, Three- 
dimensional reconstructions of specific tPN pathways; ¢, e, g, i,k, m, two-photon 
stacks of brains expressing synaptotagmin—GFP under the control of Gal4 
drivers expressed in neurons innervating such pathways (the driver name 
appears at the bottom of each panel). GFP fluorescence identifies putative 
pre-synaptic structures in the target regions (arrowheads). In c, e and i, the 
mushroom body calyx is circled. Scale bars, 25 jum. 
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Extended Data Figure 4 | Temporal dynamics of slow- and fast-adapting 
cold-activated tPNs. Calcium transients of slow- and fast-adapting cold- 
activated tPNs differ significantly in the speed of decay. a, The response of a 
fast tPN (right plot, grey trace) returns to baseline much more rapidly than that 
of a slow one (left plot, grey trace). In a, a small cooling stimulus and the 
response it elicited in a representative cell are normalized and overlaid to 
facilitate comparison of dynamics (note that the sign of cooling stimuli is 
inverted for clarity, AT ~ —3 °C, responses are AF/F recorded at cell bodies, 
stimuli and responses are recorded synchronously; see Methods for details). 
The decay constant of each response is shown next to each plot (zt, defined as 


the time from peak to 1/e ~37% of peak value). b, Decay constants of slow and 
fast cold tPN responses to a range of cold stimuli are systematically different 
(box-plot: red line, median; blue box, 25th and 75th percentiles; whiskers, 
data range; P< 0.001, t-test; AT = 2.91 + 0.75, mean + s.d.; n = 16 stimuli, 
n=7 slow cells, 9 fast cells). In ¢, a larger cooling stimulus and the response it 
elicited in a representative slow-adapting and fast-adapting cell are shown 
side-by-side to facilitate comparison. Note that the slow-adapting cell displays 
a persistent response that lasts as long as the cooling stimulus. In contrast, 

as a consequence of faster decay, the response of a ‘fast-adapting’ cell returns to 
the pre-stimulus baseline when the temperature is still changing. 
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Extended Data Figure 5 | ON and OFF responses amidst a complex 
temperature stimulus. ‘Fast-adapting cold’ (purple trace) and ‘fast-adapting 
hot’ cells (green trace) track remarkably well a dynamic temperature transient, 
displaying alternating ON and OFF responses. The cold cell type (purple 
trace) displays a significantly delayed response whenever a cooling phase is 
preceded by heating. Notably, cooling responses following a heating step begin 
when the temperature has nearly returned to baseline (red arrowheads referred 
to as OFF responses; see main text). Shown here is a representative single 
stimulus and corresponding response traces simultaneously recorded from 
adjacent cell bodies; genotype R95C02/UAS-GCaMP6m. 
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Extended Data Figure 6 | ON and OFF responses of fast tPNs are not 
limited to a defined range of absolute temperature. To better understand the 
nature of ON and OFF responses, we tested whether the dynamic activity of 
fast-adapting cells was limited to a defined range of absolute temperatures 
by delivering defined heating and cooling stimuli from a range of baseline 
temperatures. a, ON responses from cold (purple traces) and hot (green traces) 
cells recorded from a 25 °C baseline and from a higher (29 °C) or lower 

(20 °C) baseline, respectively (shown are representative single stimuli and 
corresponding response traces recorded from cell bodies). b, ¢, The activity of 
fast tPNs is not restricted to a specific temperature band as evident by plotting 
the absolute temperature corresponding to response onset (see arrows in 

a) against the baseline temperature for fast-adapting cold (b) and fast-adapting 
hot (c) cells, challenged with a range of hot and cold stimuli (AT 2-7° C; 
cooling AT = —4.7 + 1.9°C, heating AT = 4.6 + 2°C, mean + s.d.;n > 10 
cells/>10 animals per cell type). In b, c, filled purple and green dots represent 
ON responses while empty circles represent OFF responses. Owing to their 
specific delay, the OFF responses of cold cells start at the very end of a hot 
stimulus (that is, when the temperature has nearly returned to baseline 
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Tonset — baseline = 0.68 + 0.43 °C, mean = s.d.); in contrast, hot cells respond 
at or near the start of the heating phase that follows a cold stimulus (see below) 
and, as a result, systematically at a lower temperature than baseline (Tonset - 
Thaseline = —2.97 = 1.15 °C, mean = s.d.; note that this value is significantly 
different than the one calculated for cold cells, P< 0.001, t-test). d—h, Fast- 
adapting hot cells innervating the lateral pathway display OFF responses whose 
delay varies by cell. d, e, A pair of adjacent hot-activated (green arrow and 
trace) and cold-activated (purple arrow and trace) tPNs innervating the lateral 
pathway (3D reconstruction in d, cell bodies in e; same cells and colour 
coding as in main Fig. 3). f, The cold-activated tPN responded to cooling 
stimuli with a rapid ON transient. The hot-activated cell in this pair displayed a 
calcium transient that was not delayed; that is, started immediately with 

the heating phase that followed the cooling stimulus (light green trace). g, h, In 
contrast, a different fast hot cell type in the same cluster (dark green circle 
and trace) displayed a delayed response (arrowhead), similar to the delayed 
OFF response reported for cold-activated tPNs at the end of a heating 
stimulus (see Fig. 3). Scale bars in e and g, 5 um. 
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Extended Data Figure 7 | Cold-specific tPNs innervating the mALT. a-d, A 
distinct group of cold-activated tPNs follows the mALT and innervates the 
mushroom body and lateral horn (represented by a 3D reconstruction). b, The 
driver VT26020 is active in three neurons innervating this pathway (also 
showing prominent expression in auditory centres, AU, see Methods for 
details; PAL, posterior antennal lobe, the approximate position of the 

cold glomerulus is circled; genotype = VT26020-Gal4/UAS-CD8:GFP). 

c, Representative calcium-imaging traces from a single cell body displaying 
robust transients in response to cooling stimuli of increasing magnitude 
(note that these cells do not show activation in response to heating). 
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d, Stimulus-response plot showing the responses of this cell type to 

defined cooling stimuli recorded at the cell body. The responses are 
proportional to cold stimuli over a wide range of intensities (R* = 0.8, 
P<0.001; slope = —0.45, 95% CI = [—0.57 to — 0.33]; regression and slope 
statistics were calculated on all data points across individual flies). Each blue 
dot represents the maximal response of a single cell to a defined stimulus; dots 
are connected when the responses belong to the same cell, n = 3 cells/3 
animals. Grey dots represent binned averages + s.e.m. (bins = 2 °C; 
genotype: VT26020-Gal4/UAS-GCaMP6m). Scale bar in b, 25 um. 
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Extended Data Figure 8 | tPN pathways mediating temperature tPNs innervating these pathways are narrowly tuned, responding to either hot 
representation in the fly brain. Summary of the array of tPN pathways or cold stimuli, but a prominent group of cells is capable of responding to 
mediating temperature representation in the fly brain, their properties and both heating and cooling temperature stimuli (broadly tuned). Moreover, 
target regions. Multiple pathways carry specific information about stimulus response dynamics of tPNs are strikingly different, allowing a classification of 
quality (hot and cold) to the mushroom body, lateral horn and PLP. Most tPN responses in slow versus fast adapting. 
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Extended Data Table 1 | Driver lines used in this study and summary of the properties of the tPNs in which they are active 


Pathway Driver Dendrites Terminals GRASP Narrowly Broadly Tuned Speed 
(Denmark) | (syt:GFP) Tuned (t, AV+SD; secs) 
mALT R22c06 | PAL MB, LH, PLP Hot and Cold ee eee0e0@ 
Slow (8.5 + 4.6) 
e©eee0 
VT40053 | PAL MB, LH, PLP Hot and Cold os] eeeee 
VT26020 | PAL MB, LH Cold ee @ : Slow (4.4 + 1.1) 
IALT 
Posterior | posco2_| PAL LH, PLP Hotandcold | ©© g@@ - 
ee Fast (1.52 + 0.5) 
VT19428 | PAL LH, PLP Cold eeeee@ 7 
Anterior | vra6265 | PAL LH, PLP Hot r) ; 
Fast (1.5 + 0.6) 
VT60737 | PAL LH, PLP N.D. e@ ¥ 
Eel. R84e08 | PAL LH, PLP Hot and Cold ee : Slow (4 + 0.7) ¥ 
Pee R6Oh12 | PAL MB Cold & : 
Slow (6.9 + 1.7) 
R30b06 N.D. MB N.D. © - 
Driver lines expressed in specific tPNs cell types organized by pathway. Their nomenclature reflects their origin from either the Janelia Farm FlyLight initiative (R lines) or the ViennaTile project (VT lines). The 
localization of dendritic arborizations in the PAL was defined by transgenic expression of the dendritic marker DenMark, while putative pre-synaptic terminals in target regions were visualized by synaptotagmin- 


GFP expression (MB, mushroom bodies; LH, lateral horn; PLP, posterior lateral protocerebrum). The GRASP column summarizes the result of experiments performed with hot- or hot-and-cold LexA drivers (see 
Extended Data Fig. 2 for details). Hot or cold signifies that GRASP reconstitution was limited to the hot or cold glomerulus. ‘Hot and cold’ denotes broad GFP reconstitution in the PAL. Coloured spots denote the 
number of tPNs labelled by each driver and their tuning, such that a blue spot represents a single cell responding to cooling while a red spot represents a cell responding to heating. Purple spots are broadly tuned 
projection neurons. For tuning, cells are defined as hot- or cold-activated if they reliably respond to heating or cooling with average calcium increases of at least 50% AF/Fat the onset of a rapid ~2-4 °C ATstimulus 
(min n>3 animals/5 cells) and with no significant calcium increases above noise in response to the opposite stimulus (min n> 3 animals/5 cells). In all cases, we confirmed that the effect of temperature was 
significant (see Methods). ‘Speed’ refers to the decay kinetics of their responses, and is shown as t (see text and Methods for details). Note that a number of drivers are active in tPNs with different tuning properties 
butin no cases in both ‘fast-adapting’ and ‘slow-adapting’ cells; for this reason a single tis shown for each driver, corresponding to responses from a variety of stimuli (n > 10). *The R84E08 driver supports very low 
expression of G-CaMPém. Responses from these cells were recorded at the PAL and appear to be slow (t ~ 4s) but a better driver will be needed to describe in detail the response properties and dynamics of these 
tPNs. 
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CetZ tubulin-like proteins control archaeal cell shape 


Iain G. Duggin'?, Christopher H. S. Aylett', James C. Walsh”, Katharine A. Michie’, Qing Wang", Lynne Turnbull’, 
Emma M. Dawson”, Elizabeth J. Harry*, Cynthia B. Whitchurch’, Linda A. Amos’ & Jan Lowe’ 


Tubulin is a major component of the eukaryotic cytoskeleton, con- 
trolling cell shape, structure and dynamics, whereas its bacterial 
homologue FtsZ establishes the cytokinetic ring that constricts dur- 
ing cell division’”. How such different roles of tubulin and FtsZ 
evolved is unknown. Studying Archaea may provide clues as these 
organisms share characteristics with Eukarya and Bacteria’. Here 
we report the structure and function of proteins from a distinct 
family related to tubulin and FtsZ, named CetZ, which co-exists with 
FtsZ in many archaea. CetZ X-ray crystal structures showed the 
FtsZ/tubulin superfamily fold, and one crystal form contained sheets 
of protofilaments, suggesting a structural role. However, inactiva- 
tion of CetZ proteins in Haloferax volcanii did not affect cell divi- 
sion. Instead, CetZ1 was required for differentiation of the irregular 
plate-shaped cells into a rod-shaped cell type that was essential for 
normal swimming motility. CetZ1 formed dynamic cytoskeletal 
structures in vivo, relating to its capacity to remodel the cell envelope 
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Figure 1 | CetZ proteins comprise a distinct tubulin/FtsZ family. 

a, Phylogram of the tubulin superfamily across the domains of cellular life. The 
major taxa containing identified CetZ family members are shown. Bootstrap 
support is shown for selected branches. H. volcanii CetZ1-6 (Hv1-6) and other 
CetZ sequences cluster into orthologous groups. Mth CetZ belongs to a non- 
canonical group that branches from near the centre of the tree. Two archaeal 
tubulins (Artubulins) identified in Thaumarchaeota (ref. 16) are indicated 
within the tubulin branch. b, Amino acid sequence alignments of some 
conserved regions. Conservation is indicated by red shading (universal), yellow 
(primarily shared in FtsZ and CetZ), blue (CetZ and tubulin), or green (CetZ). 
T7 residues mutated to alanine in FtsZ1 (D250) and CetZ1 (E218) are 


and direct rod formation. CetZ2 was also implicated in H. volcanii 
cell shape control. Our findings expand the known roles of the FtsZ/ 
tubulin superfamily to include archaeal cell shape dynamics, sug- 
gesting that a cytoskeletal role might predate eukaryotic cell evolu- 
tion, and they support the premise that a major function of the 
microbial rod shape is to facilitate swimming. 

Many archaea produce FtsZ that appears to function in cell division**. 
However, unlike bacteria, archaeal genomes frequently contain addi- 
tional genes belonging to the FtsZ/tubulin superfamily’. These genes 
are abundant in the haloarchaea, which dominate hyper-saline lakes 
globally’® and are generally noted for their unusual flattened cell mor- 
phologies. Examples include rectangular prisms’, triangles’, and 
pleomorphic cells seen in the model organism H. volcanii'*. The molec- 
ular basis of cell shape in haloarchaea is unknown. 

The largest group of archaeal FtsZ/tubulin proteins clearly distin- 
guishable from the FtsZ and tubulin families is a deeply-rooted family 


d 
Hv CetZ2 


underlined. c, Structural comparison of FtsZ (PDB 1FSZ), Mth CetZ, and 
tubulin (PDB 1JFF). d, H. volcanii CetZ2-GTP-YS crystal structure shows a 
straight and untwisted protofilament. The C-terminal «-helix (H11, cyan) 
superimposes well with the equivalent helix in straight tubulin filaments, 
although differs from the disordered C terminus consistently observed in FtsZ. 
Secondary structural helices (H), sheets (S) and turns (T) are numbered 
sequentially. Species abbreviations: Mth, Methanosaeta thermophila; Hv, 
Haloferax volcanii; Ca, Cyanobacterium aponinum; Ec, Escherichia coli; Bs, 
Bacillus subtilis; Mtb, Mycobacterium tuberculosis; Hp, Helicobacter pylori; At, 
Arabidopsis thaliana; Sc, Saccharomyces cerevisiae; Hs, Homo sapiens. 
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from the phylum Euryarchaeota that we named CetZ (formerly annotated 
‘FtsZ3”, or ‘FtsZ type 2’), after the prototypical member described here: 
‘cell-structure-related euryarchaeota tubulin/FtsZ homologue 1’ (CetZ1, 
Fig. 1a). Multiple sequence alignments showed that CetZ family mem- 
bers contain a mosaic of tubulin-like and FtsZ-like amino acid residues; 
several of the most conserved residues involved in GTP binding and 
hydrolysis are identical in CetZ and tubulin (Fig. 1b, blue shading). In 
some other regions there is stronger sequence similarity between CetZ 
family members and FtsZ family members, leading to a closer apparent 
relationship of these two families”"® (Fig. la, b). 

To further investigate these relationships, we solved the crystal struc- 
tures of H. volcanii CetZ1 (HVO_2204) and CetZ2 (HVO_0745), from 
separate orthologous groups, and a divergent CetZ from Methanosaeta 
thermophila (Extended Data Table 1). The structures showed the FtsZ/ 
tubulin core fold, with a carboxy-terminal helical extension (H11) like 
tubulin, but without the amino-terminal helical extension seen in FtsZ 
proteins (Fig. 1c). CetZ2 crystallized as a protofilament with GTP-yS 
that showed subunit interaction geometry, spacing (43 A subunit repeat) 
and surface area (941 A’) consistent with archetypal tubulin and FtsZ 
protofilaments (Fig. 1d). Furthermore, the protofilaments were arranged 
as 2D sheets, showing lateral interactions between protofilaments con- 
sistent with those of FtsZ and tubulin (Extended Data Fig. 1). These CetZ 
structures suggest a possible cytoskeletal or cytomotive’ function. 

To search for functions of the CetZ family, we individually deleted 
the six cetZ open reading frames identified in the H. volcanii genome 
(AcetZ1; Fig. 2a and Extended Data Table 2). No differences in growth 
rate or cell size were detected in the six AcetZ strains compared to the 
parent strain (H98), indicating that CetZ proteins are not individually 
required for cell division (Fig. 2b). To investigate possible functional 
redundancy amongst the multiple CetZ family members, we designed 
a dominant-inhibitory point mutation in CetZ1, the most conserved 
of the H. volcanii CetZ family (close orthologues of CetZ1 were found 
throughout the class Halobacteria; Extended Data Fig. 2). This muta- 
tion (CetZ1(E218A), Fig. 1b) was based on well characterized mutants 
of tubulin and FtsZ that block GTPase-dependent filament disassem- 
bly, forming hyper-stable filaments that severely disrupt function’”"”. 
Expression of the equivalent mutant of H. volcanii FtsZ1 (D250A, 
Fig. 1b) resulted in a very heterogeneous cell-size distribution, includ- 
ing many extensively overgrown cells—the hallmark of a major cell 
division defect (Fig. 2c, d). Conversely, no division defect was detected 
during CetZ1(E218A) expression in H. volcanii AcetZ1 (Fig. 2c), des- 
pite the strong dominant-inhibitory behaviour of this mutant in other 
functions described below. Therefore, CetZ1 is not required for cell 
division, whereas FtsZ1 plays an important role. 

Motility assays revealed a swimming defect in H. volcanii AcetZ1;no 
substantial migration in soft agar medium was observed after 5 days 
of incubation, whereas normal expanding halos of motile cells were 
observed in strains carrying single deletions of the other five cetZ genes 
(Fig. 2e). Extended incubation showed that H. volcanii AcetZ1 was 
motile, with a significantly reduced rate compared to the wild type 
(Fig. 2f). Both strains also showed exclusion zones between adjacent 
halos (Fig. 2f and Extended Data Fig. 3a, b), indicating repulsive che- 
motaxis or sibling colony inhibition”. These results suggest that the 
AcetZ1 mutation primarily affects swimming speed. 

Toconfirma role for CetZ1 in motility, CetZ1(E218A) was expressed 
under control of the tryptophanase gene promoter on the plasmid pT A962 
(ref. 21) in an otherwise wild-type cetZ strain (H98). By using increasing 
tryptophan concentrations, chosen to give dose-responsive expression””’, 
we observed a corresponding reduction in motility compared to the con- 
trol (Fig. 2g). This dominant-inhibitory effect of CetZ1(E218A) con- 
firms that CetZ1 has an important role in motility. 

In Hv-Ca liquid medium, H. volcanii exhibited the irregular-plate 
(or ‘disc-shaped’™*) morphology (Fig. 2d). However, we discovered that 
cells withdrawn from the leading edge of Hv-Ca soft-agar motile halos 
were rod-shaped (Fig. 2h, see also Supplementary Video 1). The frequen- 
cy of rod cells decreased markedly towards the centre of the halo, where 
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Figure 2 | H. volcanii CetZ1 is essential for efficient swimming and rod-cell 
development, but not cell division. a, Western blot detection of CetZ1 in 
mid-log- and stationary-phase cultures (in the rich medium, Hv-YPC + 1mM 
Trp). Ponceau S pre-staining was used as a loading control (bottom). 
CetZ1(E218A) (in strain ID59+ pTA962-cetZ1(E218A)) builds up strongly 

in non-dividing (stationary phase) cells compared to wild-type CetZ1 
(ID59+pTA962-cetZ1), which shows net depletion in the stationary phase, 
consistent with the notion that CetZ1(E218A) polymer hyper-stability provides 
increased protection from proteolysis. b, Cell volume histograms for the six 
H98AcetZ strains during mid-log growth in Hv-Ca medium. c, Cell volume 
histograms (log-spaced bins) from cell populations producing CetZ1(E218A) 
(ID59+pTA962-cetZ1(E218A), mid-log growth in Hv-Ca + 1mM Trp), or 
FtsZ1(D250A) (H98+pTA962-ftsZ1(D250A), 20h and 40h after addition of 
1mM Trp). d, Phase-contrast images of wild-type (H98+pTA962), AcetZ1 
(ID59+pTA962) and FtsZ1(D250A) (H98+ pTA962-ftsZ1(D250A), 40h Trp) 
strains. e, Motility assays of H98AcetZ strains (Hv-Ca + 0.3% agar, day 5). 

f, Time-course motility assays using an alternative genetic background (H53), 
with four spaced inoculations per plate. The same results were obtained with 
the H98 background (not shown). g, Motility assays (day 5) of wild-type 
(H98+pTA962), CetZ1 (H98+pTA962-cetZ1) and CetZ1(E218A) 

(H98+ pTA962-cetZ1(E218A)) strains with the indicated Trp concentrations. 
High concentrations of Trp partially inhibit growth and motility in all strains. 
h, Cells withdrawn from within 2 mm of the halo edge were viewed by phase- 
contrast microscopy; cell circularity (right) was measured by automated 
analysis of the images. Leading-edge wild-type cells had a mean width of 
0.720 + 0.013 um (95% CI), s.d. = 0.094 um, indicating specific regulation of 
rod-cell width. Frequency in b, c and h is given in arbitrary units. 


very few rods were seen (Extended Data Fig. 3d-f). In contrast, samples 
from the low-motility 4cetZ1 halos were devoid of rods (Fig. 2h). 
Furthermore, moderate expression of CetZ1(E218A), which reduced 
motility compared to wild-type CetZ1 (Fig. 2g, 0.2 mM Trp), produced 
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much more rounded cells at the leading edge of the halo (Fig. 2h). CetZ1 
is therefore essential for the development of a rod-shaped cell type re- 
quired for efficient swimming. 

To determine whether CetZ1 directly controls cell-shape changes, wild- 
type and mutant proteins were overproduced using a plasmid during 
growth in liquid Hv-Ca. Overproduction of CetZ1 (by two- to threefold) 
caused a rod morphology in a substantial fraction of cells compared to 
the control (Fig. 3a, b). During CetZ1(E218A) expression, rods were 
not detected, and the cells instead formed rather jagged, blocky shapes, 
exhibiting short stalks and other protrusions (Fig. 3a, c). For both wild- 
type and CetZ1(E218A) proteins, the extent of the phenotypes was sen- 
sitive to the level of production (Extended Data Fig. 4). CetZ1 therefore 
plays a pivotal role in controlling cell shape. 

To examine cell shape and protein subcellular localization, cells pro- 
ducing additional FtsZ1 or CetZ1 tagged with green fluorescent protein 
(GFP) were visualized. FtsZ1-GFP showed mid-cell bands (rings), which 
contracted with division furrows, in plates and motile rods (Fig. 4a, b). 
In plates, CetZ1-GFP was seen throughout the cytoplasm, and loca- 
lized as spots, short filaments (or patches) of variable intensity at or 
near the envelope, particularly in regions of relatively high curvature 
at the cell edges, and at mid-cell and the division furrow (Fig. 4c and 
Supplementary Video 2). In motile rods, CetZ1-GFP localized at or 
near one or both poles, as spots, short filaments or end-caps, and at mid- 
cell in some (Fig. 4d). These patterns were recapitulated with mono- 
meric GFP(A206K) and mCherry tags (Extended Data Fig. 5). 

Similar to previous FtsZ and tubulin GFP-fusion proteins”, CetZ1- 
GFP was not fully functional, as its overproduction did not cause rod 
morphology like untagged CetZ1 (Extended Data Fig. 6a, b). However, 
overproduced CetZ1-GFP formed dynamic filaments in some cells, 
typical of the tubulin family (Supplementary Video 3). Furthermore, 
moderate expression of CetZ1(E218A)-GFP inhibited rod formation 
(Extended Data Fig. 6d-f), indicating that the fusion protein interacts 
with native CetZ1. 

Cells producing CetZ1(E218A)-GFP showed very low cytoplasmic 
fluorescence and intense patches of localization at the cell envelope, 
predominantly at the regions of high curvature caused by the E218A 
mutation, such as V-grooves (inward), corners and stalks (Fig. 4e, Ex- 
tended Data Fig. 6a, g, h and Supplementary Video 4). Consistent with 
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Figure 3 | CetZ1 overproduction changes H. volcanii cell shape. Mid-log 
cultures overproducing the indicated proteins (in Hv-Ca + 2 mM Trp) were 
sampled for phase-contrast microscopy. a, Images of wild-type vector-only 
(H98+pTA962), CetZ1 overproduction (H98+pTA962-cetZ1) and 
CetZ1(E218A) overproduction (H98+pTA962-cetZ1(E218A)) strains. (Note: 
H. volcanii cells lie on their larger flat surfaces.) b, c, Frequency distributions 
were obtained for the three strains from automated analysis of cell 
circularity (b), and relative standard deviation (RSD) of curvature (c). The 
symbols for data apply to both b and c. Frequency in b and c is given in 
arbitrary units. 
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Figure 4 | Subcellular localization of FtsZ1-GFP and CetZ1-GFP in H. 
volcanii. a, Fluorescence (green) and phase-contrast (grey) image overlay 
showing cells producing FtsZ1-GFP (H98+ pIDJL40-ftsZ1) sampled during 
mid-log growth in Hv-Ca + 0.2 mM Trp. b, Cells from a motile halo edge 
(0.2 mM Trp) displaying FtsZ1-GFP, and a cell (inset) showing evidence of 
division constriction. c, CetZ1-GFP in mid-log cells (Hv-Ca + 0.2 mM Trp) 
visualized by regular fluorescence and three-dimensional structured- 
illumination microscopy (3D-SIM; xy views of 3D-rendered stacks; all 3D-SIM 
scale bars, 1 um). 3D-SIM z-series showed that the localizations are envelope 
associated (Supplementary Video 2). d, CetZ1-GFP fluorescence in cells 
from a motile halo edge (0.2 mM Trp), showing detail of an end-capped cell 
(inset), and 3D-SIM reconstructions (right) showing single or multiple foci, 
bi-lobed structures, or short filaments. e, Phase contrast/fluorescence and 
3D-SIM of CetZ1(E218A)—-GFP during mid-log growth in Hv-Ca + 2mM Trp. 
An example ofa cell protrusion and corresponding CetZ1-GFP localization are 
indicated by the arrowhead. f, Ten-nanometre sections from electron 
cryotomography 3D reconstructions, showing detail of the cell envelope in 
wild-type (H98+pTA962) and CetZ1(E218A) cells (H98-+pTA962- 
cetZ1(E218A)), grown in Hv-YPC + 4mM Trp. Assumed layers of the 
envelope are indicated, and an extra layer of density on the inner surface of the 
envelope, which was only clear during CetZ1(E218A) overproduction, is 
indicated by the arrowhead; 3D segmentation analysis showed that the 
indicated layer occupies a surface area of 75 nm X 300 nm. S-layer, surface 
layer. g, Time-lapse imaging of CetZ1-GFP during rod development in Hv- 
YPC + 0.2 mM Trp (15-min intervals). See also Supplementary Video 8. 


this, electron cryotomography sections of cells producing high levels 
of untagged CetZ1(E218A) showed regions with an additional layer 
on the inner surface of the envelope (Fig. 4f, Extended Data Fig. 7 and 
Supplementary Video 5). Time-lapse imaging revealed that CetZ1-GFP 
localizations were highly dynamic in growing cells, moving in a time 
frame of seconds to minutes, whereas the bright CetZ1(E218A)-GFP 
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localizations were hyper-stable (Supplementary Videos 6 and 7). The 
synthetic effects of CetZ1(E218A) on cell structure, and its correspond- 
ing localization, indicate that it may directly remodel the cell envelope. 

Rod development occurred by a transition from a two-dimensional 
(expansive) growth of plates to a one-dimensional mode in rods, with a 
concomitant narrowing of cells also clear in some cases; filaments con- 
taining CetZ1-GFP were sometimes seen moving along the long axis, 
particularly at the edges near mid-cell as the rod elongated (Fig. 4g and 
Supplementary Videos 8, 9 and 10), suggesting that such filaments rep- 
resent an active form of CetZ1 in rod development. 

Altogether, our results show that CetZ1 is a key part of a dynamic 
cytoskeleton that reshapes the cell during rod development. The struc- 
tures and functions of H. volcanii CetZ1 and CetZ2, which appears to 
participate in cell-shape control (Extended Data Fig. 8), together with 
the appearance of homologues in diverse Euryarchaeota (Fig. la and 
Extended Data Fig. 2), suggest that dynamic regulation of cell shape by 
FtsZ/tubulin-superfamily proteins may pre-date the emergence of eu- 
karyotic cells. 

The tubulin-like active-site residues and helix 11 in CetZ family mem- 
bers (Fig. 1) could reflect an early evolutionary divergence of FtsZ and 
CetZ/tubulin, corresponding to the establishment ofa non-cell-division 
role, followed by a further split of CetZ and tubulin as cell architecture 
evolved in the Archaea/Eukarya lineage. On the other hand, the slightly 
higher overall sequence similarities between FtsZ and CetZ would argue 
that the tubulin-like features of CetZ arose independently of tubulin for 
similar functional purposes. 

The development of the rod-shape to attain normal swimming speed 
indicates that this is an adaptation to optimize swimming, as generally 
predicted from theoretical considerations of microbial shapes**”’. Inter- 
conversion of plate and rod cell types would therefore serve as a stop/go 
switch that responds to the local environment. We anticipate that H. 
volcanii differentiation and the CetZ family will serve as useful models 
for cell biology and cytoskeletal evolution. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Molecular phylogeny. Protein sequences were retrieved from the National Centre 
of Biotechnology Information (NCBI) website. PSI-BLAST”® was used to identify 
CetZ sequences using CetZ1 (HVO_2204) from H. volcanii as the query sequence. 
Protein sequences were aligned using MUSCLE”, and then a phylogenetic tree was 
generated using Fasttree*’. 

Protein purification. The open reading frames for H. volcanii CetZ1 (HVO_2204) 
and CetZ2 (HVO_0745) were first amplified from genomic DNA and then cloned 
into the expression plasmid pHis17 (ref. 31). pHis17-HvCetZ1 encoded the com- 
plete sequence of H. volcanii CetZ1 without additional residues, whereas pHis17- 
HvCetZ2 encoded the complete sequence of H. volcanii CetZ2, adding six histidine 
residues at the C terminus. The gene encoding M. thermophila CetZ (Mthe_0239, 
Uniprot AOB5R2) was synthesized codon-optimized (Genscript), and cloned into 
pET28a (pET28a-MtCetZ), including an additional six histidine residues at the C 
terminus. 

Escherichia coli C41 strains harbouring the expression vectors were grown in 
12 litres of 2 < YT broth supplemented with 100 pg ml’ ampicillin or 50 pg ml" 
kanamycin and grown at 37 °C. Isopropyl-B-p-1-thiogalactopyranoside was added 
toa final concentration of 1 mM (at Dgo9 = 1.0), and incubation continued for 18h 
at 20°C. Collected cells were resuspended in 20 ml per litre of bacterial culture of 
100 mM Na-HEPES pH 7.0, and then lysed using a cell disruption system (Constant 
Systems) running at 25kPSI at 4°C. The mixture was centrifuged at 45,000g for 
30 min at 4°C. H. volcanii CetZ2 and M. thermophila CetZ were purified from the 
supernatant by immobilized metal affinity chromatography (IMAC), using a 5 ml 
HisTrap HP column (GE Healthcare) running in 100 mM Tris-Cl (pH 8.5), 500 mM 
KCl, 0-1 M imidazole gradient), followed by ion exchange with a HiTrap Q column 
HP (GE Healthcare) running in 25 mM Tris-Cl pH 8.5, with a 0-500 mM KCI gra- 
dient. Size-exclusion chromatography was then carried out using a HiLoad Sephacryl 
$200 16/60 column (GE Healthcare), running in 25 mM Tris-Cl pH 8.5, 200 mM 
KCl, 1 mM EDTA, 1 mM NaN3. High-yielding fractions were pooled and concen- 
trated to 25mgml ' before and after size exclusion chromatography in 20 ml 
capacity, 30 kDa cut-off centrifugal concentrators (Vivaspin). Untagged H. volca- 
nii CetZ1 was prepared with the same method, except without the IMAC step. 
Crystallization. Initial conditions were identified using the high-throughput 
MRC-LMB crystallization facility’. All CetZ crystals were produced in 1 ul protein 
drops containing 50% protein and 50% (v/v) precipitant solutions. H. volcanii 
CetZ1 was crystallized in 20% (w/v) polyethylene glycol Mr 6000, 1M LiCl, 
100 mM sodium citrate (pH 4.0). H. volcanii CetZ2 was crystallized in 3-3.5 M 
ammonium sulphate, 100 mM sodium citrate (pH 5.0). M. thermophila CetZ crys- 
tallized in 30% (w/v) 1,2-propanediol, 20% (v/v) polyethylene glycol (Mr 400), 
0.1M Na-HEPES (pH7.5). Artificial mother liquor supplemented to 25% (w/v) 
glycerol was used as a cryo-protectant for all proteins during X-ray diffraction 
image acquisition. 

Crystallographic data collection and structure determination. Diffraction 
images were collected from single frozen crystals at European Synchrotron Research 
Facility beamline ID23eh1 (H. volcanii CetZ1 and M. thermophila CetZ) or Diamond 
beamline 103 (H. volcanii CetZ2) at 100 K. The wavelengths used for data collection 
were 0.9 A and 0.9795 A at the respective beamlines. Diffraction images were pro- 
cessed with XDS**, POINTLESS** and SCALA® software. Initial phases were de- 
termined by molecular replacement using PHASER”*. Models were built manually 
using MAIN” and refined using REFMAC** and PHENIX.REFINE”. Their respec- 
tive Ramachandran plots were used to validate the structures. The atomic model for 
H. volcanii CetZ1 placed 99.7% of residues in the most favoured (98%) region; H. 
volcanii CetZ2 had 98.2%, and M. thermophila CetZ had 99.1%. No residues from 
any of the three structures had angles lying in the disallowed (0.2%) regions. The 
density connecting the helical C-terminal residues of CetZ2 to the main body of the 
protein was weak and could not be threaded; therefore, the ordered position of a 
disconnected C terminus from the adjacent subunit was deposited as a separate 
chain (chain B). Crystallographic data are given in Extended Data Table 1. 

H. volcanii media and culture conditions. Most cultures were grown with Hv-Ca 
medium” modified to contain an expanded trace-elements solution that improved 
growth and cell-shape uniformity. The trace-elements stock solution contained, 
per litre: 5 g EDTA, 0.8 g FeCls, 0.05 g ZnCl, 0.01 g CuCh, 0.01 g CoC, 0.01 g 
H3BO3, 1.6 g MnCh, 0.01 g Ni2SO4 and 0.01 g HxMoOy. The solution was adjusted 
to pH 7.0 with NaOH, filter sterilized and stored at room temperature. It was diluted 
100-fold in media to obtain the working concentration. Where necessary, media 
was supplemented with auxotrophic requirements”, or the indicated concentration 
of L-tryptophan (Sigma). Hv- YPC medium” was used in some experiments, where 
indicated, and in genetic modification procedures. For most experiments, cultures 
were incubated at 45 °C with rotary shaking (150 rpm), and were maintained in log 
growth (D¢oo < 0.8) for at least 2 days before sampling (at Dgoo = 0.2-0.8), unless 
otherwise indicated. In addition to preliminary experiments carried out for this 


study, representative data displayed are representative of at least two biological 
replicate experiments. 

Strain and plasmid construction. Strain and plasmid construction information 
and relevant oligonucleotide sequences are given in Extended Data Table 2. Plasmids 
for controlled expression in H. volcanii, containing the tna promoter (Ptna)”, were 
based on pTA962”'. The open reading frame (ORF) encoding the smRS-GFP vari- 
ant was amplified from pJAM1020 (ref. 41), using overlap extension PCR to remove 
the internal Ndel site, and cloned between BamHI and Not] of pT A962, creating the 
vector pIDJLA40 (see Extended Data Table 2). The ftsZ1(D250A) and various cetZ 
ORFs were PCR amplified from H. volcanii DS70 (refs 42, 43) DNA, and then cloned 
between NdeI and BamHI in both pTA962 and pIDJL40. An mCherry ORF was 
cloned between BamHI and Not! in pIDJL40-cetZ1, replacing gfp, to create pIDJL117 
(Extended Data Table 2). Point mutations were generated by overlap-extension PCR 
and fragments were cloned as above, incorporating the following codon substitu- 
tions: ftsZ1(D250A) (GAC > GCC), cetZ1(E218A) (GAG > GCG) and gfp(A206K) 
(GCC > AAG). All PCR-generated fragments were sequence-verified after cloning. 

Plasmids for making deletions in H. volcanii were constructed by cloning upstream 
and downstream DNA fragments that flanked each deletion into pTA131 (ref. 40). 
Two stages were used to generate each cetZ-deletion plasmid. In the first stage, the 
upstream DNA fragment amplified from DS70 (primer pairs US-f/US-r, Extended 
Data Table 2) was spliced by overlap-extension PCR to the corresponding Ptna- 
cetZ fragment that had been amplified with primer PtnaUS-f (CTGGCGAAAGG 
GGGATGTGCTGO), the cetZ ORF reverse primer, and the expression-plasmid 
template (Extended Data Table 2). The spliced product, in which the native cetZ 
promoter was swapped for Pina, was cloned between the HindIII-BamHI sites of 
pTA131. These plasmids were demethylated by passage through E. coli C2925 
(New England Biolabs), and then used in the ‘pop-in pop-out’ method”, to replace 
the chromosomal cetZ promoters with Pina. Transformants were selected as prev- 
iously described”, and then chromosome structures were verified by allele-specific 
PCR. Preliminary experiments using the resulting strains showed no significant 
growth defects upon depletion of cetZ gene expression by withdrawal of Trp during 
mid-log growth. In the second stage, the BglII-BamHI (Ptna-cetZ) fragment in 
each plasmid was replaced with a downstream fragment (amplified with primers 
DS-f/DS-r, Extended Data Table 2), to create plasmids for deletion of each cetZ 
ORF, as described above. The resulting strains are listed in Extended Data Table 2. 
Western blotting. CetZ1 rabbit antisera were generated with a synthetic peptide 
antigen derived from the sequence of the C-terminal region of CetZ1: [C]- 
QDNIEEIRQESDSNLETLIN-amide (Cambridge Research Biochemicals, UK). 
Antisera were assayed and verified for specificity (Fig. 2a). H. volcanii cells were 
resuspended in SDS-PAGE sample buffer and then heated (95 °C, 5 min) before 
analysis by SDS-PAGE and western blotting (10-* primary antisera dilution) 
using a Criterion electrophoresis system (Bio-Rad) with nitrocellulose membrane 
(Protran, Whatman) and Chemiluminescence detection (Thermo Scientific) 
according to standard protocols. 

Coulter counting. Culture samples were diluted (10°? or 10 *) with 18% buffered 
salt water (BSW) (which included calcium)*°, and then ~ 10° cells were analysed 
with a Multisizer 4 Coulter Counter (Beckman Coulter) in the volumetric mode 
(100 jl), with a 20 jum aperture tube and 18% BSW as the electrolyte. A 2 t1m bead 
standard (Beckman Coulter), diluted in 18% BSW, was used for prior calibration. 
Motility assays. One microlitre of a mid-log culture in Hv-Ca was inoculated into 
the subsurface of an Hv-Ca + 0.3% (w/v) agar plate (20 ml of media per plate), 
with up to four inoculations per plate from the same liquid culture. Equal cell 
densities of the inoculum were used for comparison of multiple strains. Plates were 
incubated with minimal disturbance in a closed plastic bag at 45 °C for 5 days”, 
unless otherwise indicated, and then images of the plates were acquired using a 
densitometer/scanner. 

Light microscopy. For most microscopy, a 1 yl sample of culture was placed 
directly onto a slide prepared with a ~170 um thick, 1% agarose pad containing 
18% BSW. A number 1.5 coverslip was then placed on top. Phase-contrast and 
GFP epifluorescence images were acquired with a Zeiss AxioPlan 2 system with a 
1.4 NA phase-contrast objective (Carl Zeiss, Germany). 

Time-lapse imaging was performed at 37 °C, with a Nikon Ti with the Perfect 
Focus system and EMCCD camera (Cascade 1K, Photometrics), or the OMX 
superresolution system with wide-field deconvolution, described further below. 
Samples were prepared by placing a 0.5-1 ll sample of growing culture at the 
centre ofa ~0.5-mm-thick gel pad (containing media semi-solidified with 0.2-0.3% 
agarose or agar) prepared on a coverslip (circular, 8 mm diameter, number 1.5); 
this was then inverted onto the base of a 35mm number 1.5 FluoroDish (World 
Precision Instruments, Inc.), and then 3 ml of liquid media was added to cover the 
pad assembly. The lid was replaced and the dish was then incubated on the micro- 
scope stage (at 37 °C) for 30 min before imaging. 

Super-resolution imaging was performed with a V3 DeltaVision OMX 3D-SIM 
Blaze system (Applied Precision Inc., a GE Healthcare Company, Issaquah, USA). 
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A solid-state 488nm wavelength laser provided wide-field illumination and 
images were captured using a 60X 1.4 NA UPlanSApo oil objective (Olympus, 
Japan), a standard 488 nm excitation and 500-550 nm emission filter set and a 
scientific CMOS 512 X 512 pixel 15-bit camera (pco.edge, PCO AG, Kelheim, 
Germany). For wide-field deconvolution imaging, samples were sectioned using 
a 250 nm step-size and deconvolved using SoftWorX (Applied Precision, GE). For 
three-dimensional structured-illumination microscopy (3D-SIM), interference 
patterns were generated by interfering light beams** and samples were sectioned 
using a 125 nm z-step size. Raw 3-phase images were then reconstructed as previ- 
ously described***’. Deconvolved or reconstructed images-stacks were 3D rendered 
and presented using IMARIS software (v7.6.4, Bitplane Scientific). Video was pre- 
pared using Image]**. 

Automated image analysis. Phase-contrast and GFP epifluorescence images were 
analysed using Mathematica software (Wolfram Research). Phase-contrast images 
were first binarized by using a bottom-hat transform with a 30-pixel disk matrix 
for the kernel. Individual objects (cells) were then identified by using the ‘Mor- 
phological Components’ function. Background objects were excluded by three 
sanity checks using the ‘Component Measurements’ function: (1) the minimum 
cell size was 200 pixels (1 pixel = 60 nm) with a bounding disk radius of 4 pixels; 
(2) fragmented objects were excluded if the ratio of areas calculated by the pixel 
count and the ‘Area Measurement’ parameter lay outside 0.95-1.05; (3) cells had to 
contain no holes. The circularity of each cell was calculated as the bounding disk 
coverage, that is, the percentage area occupied by a cell within the minimal circle 
that completely contains the cell outline, by using the ‘Component Measurements’ 
function. To measure cell curvature, a curve was parametrically fitted to the outline 
of each cell by two Fourier expansions up to the 10th order mode; the relative 
standard deviation (RSD) of curvature was calculated from this line. 

To measure fluorescence association with the cell envelope, epifluorescence 

images were overlaid with cell outlines from each corresponding phase-contrast 
image. Fluorescence was then integrated along the normal of the parametric fit of 
the outline for 3 pixels towards the centre of the cell (Extended Data Fig. 7). In cells 
displaying localization of fluorescence (RSD > 0.5), the mean fluorescence in areas 
of high curvature (>2 standard deviations above the mean) was compared to the 
mean fluorescence intensity of the full perimeter and to the mean of the whole cell. 
The mean percentage difference between each set of two values across all cells was 
calculated. 
Electron cryotomography. H. volcanii were grown in Hv-YPC medium + 4mM 
Trp. Mid-log cells were concentrated 10-20-fold by centrifugation and resuspen- 
sion, mixed with 10-nm protein-A-coated colloidal gold particles (Sigma), and 
then plunge-frozen on Quantifoil R3.5/1 holey carbon-coated transmission elec- 
tron microscopy grids using a Vitrobot (FEI Company, USA). The grids were then 
stored under liquid nitrogen. Imaging was performed at 300 kV on an FEI Tecnai 
G2 Polara transmission electron microscope equipped with a Gatan energy filter 
(zero-loss peak mode and 20 eV slit width) anda post-GIF 4 < 4k Gatan Ultrascan 
camera binned to 2 X 2k (Gatan, USA). Tilt series were collected around a single 
axis at 1.5° increments between + 42° at a pixel size of 0.58 nm or 0.7 nm (at the 
specimen level) using SerialEM software”. The defocus was set at — 12 jum and the 
cumulative dose was <80e A~?. Tomographic reconstructions were calculated 
using the IMOD tomography package”. 
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Extended Data Figure 1 | CetZ2-GTP-yS crystals contain protofilaments 
and 2D sheets. a, Expansion of crystal symmetry (PDB 4B45) shows lateral 
association of tubulin/FtsZ-like protofilaments. The lateral repeat distance 

of 42.4 A is slightly shorter than the longitudinal distance (43.2 A). There is 
similar packing to the lattice of tubulin dimers in the microtubule wall, 
although a specific M-loop facilitates the lateral contact in tubulin. CetZ2 shows 
less intimate contacts between GTPase domains (green) in the protofilament 
compared to tubulin. b, 90°-rotated view along the protofilaments. 
Protofilaments are arranged in the same orientation, creating two different 
surfaces of the sheet. The H11 helix and C termini are located on the lower 
surface as shown, and would have the potential to interact with other molecules 
or surfaces (as seen with the C-terminal tail of FtsZ). 
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Extended Data Figure 2 | Tubulin-superfamily proteins in Euryarchaeota 
that contain CetZ. CetZ1 from H. volcanii (HVO_2204, Uniprot D4GVD7) 
was used in PSI-BLAST searches to identify tubulin-superfamily members in 
Euryarchaeota. A representative genome from all genera determined to 
contain a CetZ or non-canonical CetZ was selected (if available), and then all 
identified superfamily members from these genomes were aligned for 
generating the phylogram. Bootstrap support is shown for selected branches. 
The positions of the H. volcanii CetZ proteins are indicated (HvCetZ). All 
species identified to contain a CetZ contained at least one FtsZ; most contained 


LETTER 


HvCetZ5/6 


Euryarchaeota, non-cannonical CetZs 


Thermococci 


98 Archaeoglobi 


~ Methanomicrobia 


FtsZ1 
CF aaa 


two FtsZ proteins from distinct sub-families. CetZ1 orthologues were identified 
in all species of Halobacteria analysed, and typically shared ~80% sequence 
identity. CetZ2 orthologues were identified in many of these. A number of 
highly diverse sequences containing recognizable tubulin-superfamily 
signature motifs (Fig. 1b) were identified in Euryarchaeota, including the 
classes Thermococci, Methanomicrobia and Halobacteria (notably, in the 
rectangular prism-shaped Haloquadratum walsbyi (Hwa)). These were difficult 
to classify robustly; however, most were found to branch from near the base of 
the CetZ-family bough and were referred to as non-canonical CetZs. 
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Extended Data Figure 3 | The role of CetZ1 in H. volcanii motility. 

a, b, Motility assays of the AcetZ1 strain (ID59+pTA962) were performed with 
closely-spaced (a) or standard-spaced (b) inoculation sites. Exclusion zones 
were observed between halos after extended incubation (16 days). Irregularities 
in halo structures were seen in wild-type and mutant halos, which may be 
related to colony exclusion (repulsion or inhibition) between local clouds of 
motile cells. c, Motility agar plates (containing 0.2 mM Trp), were imaged once 
daily over 4 days after inoculation. Strains containing wild-type CetZ1 
(H98+ pT A962 and H98+pTA962-cetZ1) show an abrupt development of 


motile halos from day 3. Strains containing cetZ1 mutations (H98+pTA962- 
cetZ1(E218A) and ID59+pTA962 (AcetZ1)), were strongly inhibited in 
motility. d, A halo of wild-type H. volcanii (H98+pTA962) is shown in low and 
high contrast, showing the high-motility fainter outer zone and the denser 
population of cells growing more centrally. Samples were withdrawn from the 
indicated positions (1-4) of the motility halo for microscopy. e, f, Cell 
circularity distributions (e) were then generated from automated computer 
analysis of images of cells (f) sampled at the four positions. 
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Extended Data Figure 4 | H. volcanii cell shape is sensitive to CetZ1 and 
CetZ1(E218A) concentration. a, Phase-contrast images of wild-type 
(H98+pTA962), CetZ1-overproduction (H98+pTA962-cetZ1) and 
CetZ1(E218A)-overproduction (H98+pTA962-cetZ1(E218A)) strains in mid- 
log growth in Hv-Ca medium containing the indicated concentrations of Trp. 
b, c, Images of the strains were analysed for cell circularity (b) and the 
relative standard deviation (RSD) of curvature (c) of cell outlines. The wild-type 
(WT) control (H98+pTA962, 4mM Trp) is shown in both b and c. At greater 
than 2 mM Trp, the circularity and curvature parameters did not increase 
further. However, during CetZ1(E218A) overproduction (4mM Trp) cells 
appeared to show slightly larger highly distorted cells with more noticeable 
ruffling and protrusions (a, bottom right image). d, Coulter cell volume analysis 
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of the strains confirmed that CetZ1(E218A) overproduction (4mM Trp) 
showed slightly larger cells, indicating delayed cell division—a possible effect of 
significant cell distortion. e, Western blot analysis of H98+ pTA962-cetZ1 
(+CetZ1) and H98+pTA962-cetZ1(E218A) (+E218A) sampled during 
mid-log growth in Hv-Ca medium with the indicated concentrations of Trp. 
Ponceau S pre-staining of the membrane for total protein, as a loading 
reference, is shown below the CetZ1 western blot. The levels of CetZ1 increase 
in response to increasing concentrations of Trp in the medium. CetZ1(E218A) 
showed consistently higher levels compared to wild-type CetZ1, consistent 
with protofilament hyper-stability resulting in a slower rate of in vivo 
degradation of CetZ1(E218A) (see also Fig. 2a). 
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Extended Data Figure 5 | Localization of CetZ1-GFP and variants, CetZ1- _H.. volcanii). Strains expressing the indicated CetZ1 fusion proteins in pTA962- 
GFP(A206K) and CetZ1-mCherry, in H. volcanii. CetZ1 tagged with the based plasmids were sampled during mid-log phase growth in Hv- 

GFP variant used primarily in this study, smRS-GFP* was compared to CetZ1 | Ca + 0.2 mM Trp (log-phase cells) and from the leading edge of motile halos 
tagged with the A206K mutant of smRS-GFP (which blocks GFP self- (0.2 mM Trp). 

association*'”’) and the mCherry protein*!”* (which shows red fluorescence in 
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Extended Data Figure 6 | Expression of CetZ1-GFP and CetZ1(E218A)- 
GFP in H. volcanii. a, Phase-contrast (left) GFP-fluorescence (middle), and 
phase/GFP overlay (right) images of strains H98+pIDJL40 (GFP vector only), 
H98+pIDJL40-cetZ1 (+CetZ1-GFP) and H98+pIDJL40-cetZ1(E218A) 
(+CetZ1(E218A)—GFP) sampled from mid-log growth in strong- 
overproduction conditions (Hv-Ca + 4mM Trp). The GFP-vector-only 
fluorescence image is high-contrast, showing low-level background GFP 
fluorescence. The CetZ1-GFP and CetZ1(E218A)-GFP fluorescence images 
are presented with identical exposure and contrast settings, showing relatively 
higher background fluorescence for CetZ1-GFP and higher localized 
fluorescence for CetZ1(E218A)-GFP, indicating that assembly of the 
localized protein is stabilized by the E218A mutation (in the GTPase domain of 
CetZ). b, Cell circularity distributions for the indicated strains show no 
stimulation of rod shape during overproduction of either CetZ1-GFP or 
CetZ1(E218A)—GEP. c, Cell curvature distributions indicate that 
overproduction of CetZ1(E218A)-GFP results in blocky, stalked cells, as seen 
with untagged CetZ1(E218A) (Extended Data Fig. 4). d, Western blot analysis 
of CetZ1 in the indicated strains, showing increasing expression of CetZ1-GFP 
fusion proteins in response to Trp concentration. e, Motility assays for the 
indicated strains (H98 background, 0.2 mM Trp), showing significant 


inhibition of motility by CetZ1(E218A); this may result from higher level 
production of this protein (see Extended Data Fig. 4e), or stronger inhibition of 
native CetZ1 function. f, Cell circularity analysis of cells at the leading edge 
of the halos in e suggests that strong inhibition of rod shape is needed to see an 
obvious reduction in motility. (Data for wild-type and +CetZ1(E218A) 
strains are reproduced from Fig. 2h.) g, Cell outlines resulting from the 
automated cell detection process in phase-contrast and the corresponding 
GFP-fluorescence images. Magnified images show the normal lines (3 pixels 
long) on the inside of the cell outline that were integrated to measure 
fluorescence near the envelope (the closely spaced normal lines give the 
appearance of a thick outline of the cell). Sections of high curvature 

(>2 standard deviations above the mean) are shown in green, whereas the 
remainder is shown in red. h, A plot of the curvature (red) and the GFP 
localization (green) of the magnified cell, measured anticlockwise from the 
arrowhead. In automated analysis of cell populations, the curvature of each 
whole cell was calculated as the relative standard deviation of this line. Analysis 
of the strain expressing CetZ1(E218A)-GFP (in Hv-Ca + 2 mM Trp) showed a 
73% greater mean fluorescence in areas of high curvature versus the whole cell, 
53% greater fluorescence for areas of high curvature versus the whole perimeter, 
and 10% greater fluorescence for the perimeter versus the whole cell. 
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Extended Data Figure 7 | Electron cryotomography of H. volcanii. Cells the cell envelope. b, Wild-type cell, with detail of the cell envelope region (inset 
growing in Hv-YPC + 4mM Trp were frozen in vitreous ice to maintain native _ as per Fig. 4f). c, Cells overproducing CetZ1(E218A) (H98+pTA962- 
structure, and then tomographic data were acquired with a 300 kV cryo- cetZ1(E218A)), show extensive sections of the envelope with underlying fibres 
transmission electron microscope. A 10 nm section from the tomographic (inset as per Fig. 4f). (Supplementary Video 5 shows a segment image series 
reconstruction of the whole cell is shown for each. a, H98+pTA962 (wild-type) _ through a cell, showing the sheet-like structure of the extra layer.) d, Example 
cell, showing cell envelope layers with some structural detail of the outer S-layer _ showing subtle ruffling of the cell envelope associated with the additional 
visible, a dense granule, particles consistent with ribosomes visible in the envelope fibres in a cell producing CetZ1(E218A). Scale bars, 100 nm for all 
outer ~25% of the cytoplasmic region, and fibres that may be seen underlying _ panels, and 50 nm for all insets. 
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Extended Data Figure 8 | Expression of CetZ2(E212A) affects H. volcanii 
cell shape and motility. The T7-loop mutation CetZ2(E212A) is equivalent to 
CetZ1(E218A). a, Expression of CetZ2(E212A) in motility assays, by 

addition of increasing concentrations of Trp, inhibits motility, but to a lesser 
degree than CetZ1(E2184A). b, Cells were withdrawn from the leading edge of 
the indicated motile halos. Moderate expression of CetZ2(E212A) (0.2 mM 
Trp) partially inhibits rod development, and higher expression (1 mM Trp) 
strongly inhibits rod formation and produces cells with sharp corners and 
regions of high curvature, similar to cells expressing CetZ1(E218A) (Fig. 4). 
CetZ2(E218A) is therefore dominant-inhibitory to rod development. c, Coulter 
cell volume analysis of cells withdrawn from mid-log cultures of wild-type 
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(H98 + pTA962), CetZ2-overproducing (H98 + pTA962-cetZ2) and 
CetZ2(E212A)-overproducing (H98 + pTA962-cetZ2(E212A)) strains shows 
that overexpression of CetZ2 or CetZ2(E212A) does not affect cell division. 
d, Phase-contrast images of cells from the same cultures shown in c. Data 
obtained from automated analysis of images for each strain is shown in the 
plots in d. Unlike CetZ1 (Fig. 4), overexpression of CetZ2 in liquid culture did 
not stimulate rod morphology, indicating that CetZ2 is not a dominant driver 
of rod formation. However, analysis of cell curvature (bottom right) showed 
that expression of CetZ2(E212A) produced a high-curvature phenotype, 
similar to CetZ1(E218A) (Fig. 4), indicating that this mutant specifically 
interferes with shape regulation in H. volcanii. 
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Extended Data Table 1 | Crystallography data collection and refinement statistics 


CetZ2 — GTPyS (PDB 4B45) 
Data collection 


CetZ1 — GDP (PDB 4B46) 


LETTER 


Mth CetZ — GDP (PDB 3ZID) 


Space group P1 P 2, 2; 2; P 2, 
Cell dimensions 

a, b, c (A) 42.4, 43.2, 48.5 56.3, 62.2, 96.5 44.6, 107.0, 77.0 

a, By (°) 68.9, 75.3, 86.9 90.0, 90.0, 90.0 90.0, 91.42, 90.0 
Resolution (A) 40.3 — 2.0 48.7-1.9 50.0 — 2.0 
Rsym OF Rmerge” 0.13 (0.94) 0.11 (0.58) 0.04 (0.66) 
Tol *¥ 8.1 (1.5) 10.0 (1.9) 11.4 (1.5) 
Completeness (%)* 97.9 (96.9) 99.8 (99.4) 92.9 (92.3) 
Redundancy* 3.5 (3.6) 4.2 (3.3 1.8 (1.8) 
Refinement 
Resolution (A) 2.1 1.9 2.0 
No. reflections { 34258 50705 74471 
Rvorki Riree £§ 0.18 (0.23) 0.17 (0.21) 0.18 (0.23) 
No. atoms 

Protein 2502 2460 5531 

Ligand/ion 32 28 56 

Water 99 197 185 
B-factors Il 

Protein 55.82 28.33 39.34 

Ligand/ion 49.64 18.32 22.70 

Water 53.64 36.37 41.89 
R.M.S. deviations 

Bond lengths (A) 0.007 0.006 0.008 

Bond angles (° 1.09 1.04 1.21 


*The values recorded in parentheses refer to the highest resolution shell of the data set collected. 

+ Diffraction data was truncated at the resolution at which the half-set correlation coefficient fell below 0.7. 
{The values quoted are those calculated and output by PHENIX.REFINE®? during the final refinement run. 
§ A set representing 5% of the reflections collected was randomly selected before refinement. 

|| Mean atomic B-factors for each class of atom described in the final, deposited model. 
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Extended Data Table 2 


H. volcanii strains, plasmids and oligonucleotides 


Strain Genotype Description* Oligonucleotides used for construction (5’ to 3’); Source 
DS70 (DS2) ApHV2 Wild-type H. volcanii cured of pHV2 T. Allers*® 
H53 (DS70) ApyrE2 Auxotroph T. Allers”” 
AtrpA 
H98 (DS70) ApyrE2 Auxotroph T. Allers” 
AhdrB 
ID59 (H98) AcetZ1 Deletion of cetZ1 (HVO_2204, 1US-f: COGGCCAAGCTTGCGAGTTCGTCTCCTT.CACGA This study 
pape = 1US-r: GCAGCACATCCCCCTTTCGCCAGATCTATGGGGACGTGTCGAACCAC 
Uniprot: D¢GVD7) 1DS-+f: CCCCCAGATCTCTGGAGTCGCTTTTCTAAGCTGGC 
1DS-r: CCCCCGGATCCACGTCTGCTCGGCTTGTTGC 
i 2US-f: CCGGCCAAGCTTGGGAGCCACTACCGTCACCAC i 
Ee (H98) Acetz2 cee ae PAO OTeS: 2US-r: GCAGCACATCCCCCTTTCGCCAGATCTCCTCACATGGGGACTGTTATCTC Thealney 
niprot: ) 2DS-f: CCCCCAGATCTAACCGACTCCTGTCGTTTGC 
2DS-r: CCCCCGGATCCGTCCGGTCGCTCTCCAGTTC 
4 i 3US-f: CCGGCCAAGCTTGCTCAACGAGGGCGACACC i 
ee (H98) Acetz3 peeron OF cates (HVO_1113, 3US-r: GCAGCACATCCCCCTTTCGCCAGATCTGTGGGTAGGGCTGTGCAAC This smudy 
Uniprot D4GW48) 3DS-f: CCCCCAGATCTGCTGTTCTGACCGTCTTCGTC 
3DS-r: CCCCCGGATCCCGAAGTCCTCCCGAGAGTCG 
ID61 (H98) Acetz4 Deletion of cetZ4 (HVO_A0035, 4US-f: CCGGCCAAGCTTGCTGACTGAGTTCGTTCGTTCTG This study 
Uniprot: D4GQ67 = 4US-r: GCAGCACATCCCCCTTTCGCCAGATCTATGGGACCACCCTACCTACCC 
niprot: Q67) 4DS-f: CCCCCAGATCTTTTTTAATGGATACCCCTGTCTCG 
4DS-r: CCCCCGGATCCGAACACGCAACCGGCATTC 
ID85 (H98) AcetZ5 Deletion of cetZ5 (HVO_2013, 5US-f: CCGGCCAAGCTIGTCCTCCCCGTTTTTCAGTTGC This study 
ene = 5US-r: GCAGCACATCCCCCTTTCGCCAGATCTTGACACGTCATCAGTGGTGAATC 
Uniprot: D4GU26) 5DS-f: CCCCCAGATCTCCGACCGCCTCGACGATAG 
5DS-r: CCCCCGGATCCGTTCGCTACGCTGCCTGCTT 
ID64 (H98) AcetZ6 Deletion of cetZ6 (HVO_2068, BUS-f: CCGGCCAAGCTICGCTATCTCGATATCCCGTTCAA This study 
Uniprot: D4GU81 6US-r: GCAGCACATCCCCCTTTCGCCAGATCTGTTGTCTGCTCTGCGACGTACC 
niprot: ) 6DS-f: CCCCCAGATCTAACGCACGGGGGTCGATAA 
6DS-r: CCCCCGGATCCTCAGCTCCGTCTCCGACTCA 
ID71 (H53) AcetZ1 _— As for ID59 As for ID59 This study 
Plasmid Description Oligonucleotides used for construction (5’ to 3’) Source 
pTA962 Expression vector. tnaA promoter. T. Allers*’ 
pyrE2 and hdrB markers. 
_fusi i GFP-f, GGCCGGATCCATGAGTAAAGGAGAAGAACTTTTCAC ; 
pIDJL40 oo vector based GFP-r: GEAAGAATGCGGCCGCTTATTTGTATAGTTCATCCATGCCATG This study 
ls 
pIDJL117 cetZ1-mCherry expression plasmid — Ch: GGCCGGATCCGCTGGCTCCGCTGCTGGTTC This study 
based on pTA962.§ mCh-r: GGAAGAATGCGGCCGCTTACTTGTACAGCTCGTCCATGCC 
pTA962-ftsZ1.D250A ftsZ1.D250A expression ftsZ1-4: CCCCCGGGAATTCATATGGACTCTATCGTCGGCGACGC This study 
ftsZ1-r: CGCGGATCCCTACTCGACGTAGTCGATGTCTTCGAG 
F1.D250A-f: CCTCGACTTCGCCGCCGTGAAGACCGTCATGG 
F1.D250A-r: CCATGACGGTCTTCACGGCGGCGAAGTCGAGG 
i si i ftsZ1-f: (shown above) i 
pibeeMsteet RSZT gip expression ftsZ1(NS)-r : CGCGGATCCCTCGACGTAGTCGATGTCTTCGAG Thisebiey 
a i cetZ1-f, CCCCCGGGAATTCATATGAAGCTCGCAATGATCGGATTCGGG i 
ts Cet expressing cetZ1-r: C@CGGATCCTTAGAAAAGCGACTCCAGTTCGTCCTCG thipstaey 
pTA962-cetZ1.E218A cetZ1.E218A expression cetZ +t. (shown above) This study 
cetZ1-r: (shown above) 
C1.E218A-f: CGTCGTCGACTCCTCGGCGATTATCAACACGC 
C1.E218A-r: GCGTGTTGATAATCGCCGAGGAGTCGACGACG 
pIDJL40-cetZ1 cetZ1-gfp expression cetZ1-f, CCCCCGGGAATTCATATGAAGCTCGCAATGATCGGATTCGGG This study 
cetZ1(NS)-r: C@CGGATCCGAAAAGCGACTCCAGTTCGTCCTCG 
pIDJ L40-cetZ1.E218A cetZ1.E21 8A-gfp expression cetZ1-f and cetZ1(NS)-r (shown above) with pTA962-cetZ1.E218A PCR template This study 
pIDHV262 cetZ1-gfp.A206K expression A206K-f: CCTGTCCACACAATCTAAGCTTTCGAAAGATCCC This study 
A206K-r: GGGATCTTTCGAAAGCTTAGATTGTGTGGACAGG 
pTA962-cetZ2 cetZ2 expression cetZ2-f, CCCCCGGGAATTCATATGAAAACCGTCCTGATTGGTGTGGGG This study 
cetZ2-r: C@CGGATCCTCACAGCAGGTCGTCGAGGTCGTC 
pTA962-cetZ2.E212A cetZ2.E212A expression celZ2:f: (shown above) This study 


References 21, 40 and 43 are cited in this table. 


cetZ2-r: (shown above) 
C2.E212A-: CETCGACACCTCCGCGGTCATCAACACCCTCC 
C2.E212A-r: GGAGGGTGTT GATGACCGCGGAGGTGTCGACG 


*Locus identifier (HVO) and Uniprot database (http://www.uniprot.org) codes are given with the proposed cetZ gene names. 


+PCR primers shown were used for amplification from DS70 or plasmid DNA as described in the Methods. Relevant restriction enzyme cutting sites are underlined. Complementary primers for generating point 
mutations were used with the ORF primers in overlap-extension reactions to generate the indicated mutant genes. US/DS-f/r - Upstream/Downstream-forward/reverse. (NS) — no stop codon. 

tsmRS-GFP was obtained from pJAM1020 (ref. 41) (from J. Maupin-Furlow). The internal Ndel site was removed by overlap extension PCR to incorporate a silent mutation (CATATG to CACATG), before cloning 
between BamHI and Not! of pTA962. 

§ mCherry was obtained from pROD17 (from D. Sherratt). The internal Pvull site was removed by overlap-extension PCR to incorporate a silent mutation (CAGCTG to CAGCTC), and a mutation in the mCherry 
template was corrected during pIDJL117 construction (primers not shown). 
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Tetanus toxoid and CCL3 improve dendritic cell 
vaccines in mice and glioblastoma patients 


Duane A. Mitchell??*, Kristen A. Batich?**, Michael D. Gunn*?, Min-Nung Huang®, Luis Sanchez-Perez’, Smita K. Nair®, 
Kendra L. Congdon”, Elizabeth A. Reap”, Gary E. Archer’, Annick Desjardins”, Allan H. Friedman’, Henry S. Friedman’”, 
James E. Herndon II’, April Coan’, Roger E. McLendon’, David A. Reardon’, James J. Vredenburgh*?, Darell D. Bigner>*? 


& John H. Sampsonh?3;>8 


After stimulation, dendritic cells (DCs) mature and migrate to drain- 
ing lymph nodes to induce immune responses’. As such, autolog- 
ous DCs generated ex vivo have been pulsed with tumour antigens 
and injected back into patients as immunotherapy. While DC vac- 
cines have shown limited promise in the treatment of patients with 
advanced cancers” * including glioblastoma* ’, the factors dictating 
DC vaccine efficacy remain poorly understood. Here we show that 
pre-conditioning the vaccine site with a potent recall antigen such as 
tetanus/diphtheria (Td) toxoid can significantly improve the lymph 
node homing and efficacy of tumour-antigen-specific DCs. To assess 
the effect of vaccine site pre-conditioning in humans, we randomized 
patients with glioblastoma to pre-conditioning with either mature 
DCs° or Td unilaterally before bilateral vaccination with DCs pulsed 
with Cytomegalovirus phosphoprotein 65 (pp65) RNA. We and other 
laboratories have shown that pp65 is expressed in more than 90% of 
glioblastoma specimens but not in surrounding normal brain’, 
providing an unparalleled opportunity to subvert this viral protein 
as a tumour-specific target. Patients given Td had enhanced DC 
migration bilaterally and significantly improved survival. In mice, 
Td pre-conditioning also enhanced bilateral DC migration and sup- 
pressed tumour growth in a manner dependent on the chemokine 
CCL3. Our clinical studies and corroborating investigations in mice 
suggest that pre-conditioning with a potent recall antigen may rep- 
resent a viable strategy to improve anti-tumour immunotherapy. 
To evaluate the influence of vaccine site pre-conditioning on DC migra- 
tion clinically, we conducted a randomized and blinded clinical trial in 
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newly diagnosed glioblastoma (GBM) (Extended Data Fig. 1). A total 
of 13 patients consented to this trial but only 12 were randomized as 1 
progressed before randomization (Extended Data Table 1). Patients were 
randomized to unilateral vaccine site pre-conditioning with unpulsed, 
autologous DCs° or Td, on the basis of our hypothesis that it would 
induce inflammation at the vaccine site’. The accumulation of injected 
DCs in vaccine site-draining lymph nodes (VDLNs) was significantly 
greater in patients given Td (Fig. 1a). Moreover, Td-treated patients also 
showed a significant increase in both progression-free survival (Fig. 1b) 
and overall survival (Fig. 1c) compared to DC-treated patients. From the 
time of diagnosis, patients in the DC cohort had median progression- 
free and overall survivals of 10.8 and 18.5 months, respectively. Thus, 
the median progression-free and overall survivals for the DC cohort 
were consistent with patients treated with the standard of care™*. Three 
censored patients from the Td cohort did not progress and were alive 
at the time of survival analysis (>36.6 months). Overall, these prognos- 
tic factors varied across both treatment groups as expected in a small 
clinical trial. However, there was no discernible trend across prognostic 
factors that would suggest that these factors alone account for the observed 
differences in survival between cohorts. Using both a previously pub- 
lished recursive partition analysis’* and the European Organization for 
Research and Treatment of Cancer (EORTC) nomogram”* for predict- 
ing outcome of patients with GBM, Td-treated patients exceeded expected 
survival times by a far greater degree than did DC-treated patients in both 
cases by nearly the same amount (Extended Data Table 1). The vaccine 
responses in long-term survivors varied in durability, but pp65-specific 
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Figure 1 | Td pre-conditioning increases DC migration to VDLNs and is 
associated with improved clinical outcomes. a, DC migration in Td (n = 6) 
versus unpulsed DC (n = 6) patients (two sample t-test, P = 0.049). 

Mean + s.e.m., 1 values represent biological replicates of patient bilateral 
inguinal lymph nodes (iLNs). b, ¢, Patient progression-free survival (b) and 
overall survival (c) (log-rank test, P = 0.013). d, Hazard ratios (HRs): DC 


migration efficiency from Td and DC cohorts showing the effect of a 1-unit 
increase in the percentage migration on progression-free (left) and overall 
(right) survival (Cox proportional hazards model, progression-free survival 
HR = 0.845 P = 0.027; overall survival HR = 0.820 P = 0.023). In b and 

c, n = 3 censored Td patients (no progressive disease at survival analysis). 


1Preston Robert Tisch Brain Tumor Center, Duke University Medical Center, Durham, North Carolina 27710, USA. “Division of Neurosurgery, Department of Surgery, Duke University Medical Center, 
Durham, North Carolina 27710, USA. *Department of Pathology, Duke University Medical Center, Durham, North Carolina 27710, USA. “Division of Cardiology, Department of Medicine, Duke University 
Medical Center, Durham, North Carolina 27710, USA. Department of Immunology, Duke University Medical Center, Durham, North Carolina 27710, USA. °Division of Surgical Sciences, Department of 
Surgery, Duke University Medical Center, Durham, North Carolina 27710, USA. 7Department of Biostatistics and Bioinformatics, Duke University Medical Center, Durham, North Carolina 27710, USA. 
8Department of Radiation Oncology, Duke University Medical Center, Durham, North Carolina 27710, USA. 


*These authors contributed equally to this work. 


366 | NATURE | VOL 519 | 19 MARCH 2015 


©2015 Macmillan Publishers Limited. All rights reserved 


immune responses were detectable for several months in all long-term 
survivors. An increase in pp65-specific interferon-y spot-forming units 
from baseline did correlate with overall survival, and the two long-term 
survivors for which samples were available had the highest increases 
in pp65-specific immune responses after vaccination. In addition, we 
observed a notable association between DC migration to the VDLNs 
and progression-free and overall survival (Fig. 1d) in patients with GBM 
receiving pp65 RNA-pulsed DC vaccines. 

To validate these clinical results and understand the mechanistic 
underpinnings, we performed analogous studies in a mouse model. 
Vaccine sites of Td-immune mice were pre-conditioned with Td and 
then received a bilateral vaccine of ovalbumin (OVA) RNA-pulsed DCs. 
In parallel to our clinical findings, Td-immune mice receiving Td pre- 
conditioning had a threefold increase in DCs within the afferent inguinal 
lymph nodes (Fig. 2a). This effect was attributable to Td-specific recall 
responses as mice not primed with Td (Td-naive mice) did not display 
any increased DC migration to VDLNs (Fig. 2b). Vaccination and pre- 
conditioning with other CD4-dependent protein antigens also increased 
DC migration, suggesting that this may bea generalizable phenomenon 
(Extended Data Fig. 2). Subsequent studies performed in Td-treated 
mice demonstrated that only selective depletion of CD4* T cells abro- 
gated the increase in DC migration (Fig. 2c). The effect of enhanced 
migration was also transferable to naive mice administered Td-activated 
CD4* T cells (Fig. 2d). 

In patients with GBM randomized to unilateral Td pre-conditioning 
(Fig. 2e) and in mice (Fig. 2f), we observed an increased uptake of 
"In-labelled DCs in bilateral lymph nodes, suggesting that Td pre- 
conditioning increased DC migration through systemic mediators. Sub- 
sequent experiments revealed that Td-activated CD4* T cells adminis- 
tered systemically in naive mice were also sufficient to increase bilateral 
DC migration (Extended Data Fig. 3). Vaccine site pre-conditioning with 
unpulsed DCs or TNF-« (ref. 8) only increased DC migration ipsilat- 
erally (Extended Data Fig. 4a, b). 

On the basis of our observations that Td recall responses could induce 
bilateral DC migration and that systemic administration of Td-activated 
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CD4* T cells was sufficient to recapitulate the increased DC migration, 
we sought to examine the induction of CD4-dependent inflammatory 
mediators in the serum of patients and mice after a recall response with 
Td pre-conditioning. CCL3 was the only chemokine or cytokine to be 
increased in both patients and mice and had the greatest fold increase 
of all chemokines in the serum of both after Td pre-conditioning (Fig. 3a 
and Extended Data Fig. 5a—d). 

To identify the site of the CCL3 production, we assayed the pre- 
conditioning sites in mice and found high concentrations of this che- 
mokine only unilaterally at the site of Td pre-conditioning (Extended 
Data Fig. 6a). Subsequent experiments showed that CCL3 upregulation 
in the skin was dependent on the induction of the Td recall response 
(Extended Data Fig. 6b) and was significantly reduced by CD4* T-cell 
depletion (Extended Data Fig. 6c). Induction of CCL3 by Td pre- 
conditioning remained increased over time compared to mice lacking 
Td recall responses (Extended Data Fig. 6d). 

To evaluate the importance of CCL3 in mediating DC trafficking 
to VDLNs in vivo, we immunized and boosted Ccl3-’~ mice with Td 
as described above and found that the migration of injected DCs to 
VDLNs in Ccl3~’~ mice was significantly lower than in wild-type Td 
pre-conditioned mice (Fig. 3b, top). Endogenous migration experiments 
revealed that resident Langerhans cells also did not migrate as efficiently 
in Ccl3~’~ hosts after Td pre-conditioning (Extended Data Fig. 7). 
To address the possibility that Td priming may have failed in Ccl3"’~ 
mice owing to some earlier role of CCL3, we compared the CD4 T-cell- 
dependent immune responses to Td in both wild-type and Ccl3/~ 
hosts and found no differences in the ability of these two strains to 
mount anti-tetanus immune responses after Td priming and boosting 
(Extended Data Fig. 8). 

Our studies demonstrated that CD4* T cells activated by the Td recall 
response were sufficient to induce increased DC migration in naive mice 
(Fig. 2d). However, Td-activated CD4° T cells could not rescue the 
limited DC migration in Ccl3"’~ hosts indicating that activated T cells 
were necessary, but not sufficient to increase DC migration (Fig. 3b, 
bottom). Exogenous administration of CCL3 could rescue the limited 
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Figure 2 | Td recall response activates CD4* T cells to increase DC 
migration to VDLNs. a, Control inguinal (n = 5 mice) versus Td inguinal 
(n = 5 mice); two sample t-test, P = 0.0001; Td popliteal (n = 5 mice) versus Td 
inguinal, paired t-test, P = 0.014. b, Mice primed and boosted with saline 
(primary Td, n = 6) or Td (control and recall Td, n = 6) with Td (primary and 
recall Td) or saline (control) pre-conditioning; one-way analysis of variance 
(ANOVA), P = 0.004; post-hoc Tukey t-test, control versus recall Td, 

P = 0.006; primary Td versus recall Td, P = 0.011. c, DC migration in depleted 
Td-immunized mice (n = 5); one-way ANOVA, P < 0.0001; post-hoc Tukey 


t-test, Td versus CD4, P = 0.005; Td versus CD8, CD19 or NK1.1, P> 0.05. 
d, DC migration after CD4° transfer (n = 4 mice); one-way ANOVA, 
P<0.0001; post-hoc Tukey t-test, Td-activated CD4* T cells (CD4,,;) versus 
CD4naive: P< 0.05; control versus CD4, ive, P > 0.05; Td versus CD4,., 

P> 0.05. e, Patient (e) and mouse (f) iLN ipsilateral (n = 6) and contralateral 
(n = 6) to pre-conditioning; paired t-test, P = 0.28 (e) and P = 0.37 (f).a-d and 
f are representative of four experiments; mean + s.e.m. Data in a—-d denote 
biological replicates of individual right and left iLN or lymph nodes ipsilateral 
to Td/saline for popliteal and GFP groups. 


19 MARCH 2015 | VOL 519 | NATURE | 367 


©2015 Macmillan Publishers Limited. All rights reserved 


LETTER 


a b ¥ 

10- ; 15 ————I 
2] mam Patients ." 
e = Mice 
8 84 = 
= = 10 
S < f 
@ 4 fe} 
£ 6 2 5 
5 i) Fe 
£ 4] E 5 . 
«4 9 aa iy 
2 a . ae 
& . 
& 24 ‘ 
2 
2 Il {] Til (| WT Cci3~- 
irs 

o-8 i il ‘ ] i fi i i #4 Td pnime (i.m.) + + 

fy 26. aN: 2S uN 5Y. 3S: abe 2D pre-conditioning + + 
YS? PaO NY OY OO i 
XP NX SO a Yo (.d.) 


nO nN 
i=} oa 
DC migration (%) 
3 
| 
| | 


DC migration (%) 
sa 


a 


0) 


Cel3- WT 
Td prime (i.m.) + - + + 
Td pre-conditioning (i.d.) + - + + 
CCL3i.v. - + + - 


Figure 3 | Td recall responses and induced CCL3 cooperate to facilitate DC 
migration to VDLNs. a, Serum CCL3 fold increase over non-Td cohorts 
(patient, n = 6; mouse, n = 8 biological replicates); signed-rank test, P = 0.031 
and P = 0.039. b, Top, DC migration in wild-type (WT) (n = 5) versus Ccl3~/— 
(n = 4) mice; two sample t-test, P = 0.023. Bottom, Td-activated cp4* 
transfer in Ccl3’~ hosts (n = 4 mice); two sample t-test, P = 0.029. c, CCL3 
and Td recall responses rescue migration (n = 4 mice); one-way ANOVA, 
P<0.0001; post-hoc Tukey t-test, Ccl3-/~ Td plus Td i.d. and CCL3 iv. versus 
Ccl3-/~ Td plus Td id. and versus Ccl3-’~ plus CCL3 i.v., P= 0.007 and 
P= 0.001, respectively. Data in b and c denote biological replicates of 
individual right and left iLNs. Representative of three experiments; 

mean + s.e.m. i.d., intradermal; i.m., intramuscular; i.v., intravenous. 


DC migration in Ccl3-’~ mice, but only when the Td recall responses 
were induced (Fig. 3c), indicating that the ability of Td pre-conditioning 
to increase DC migration to VDLNs was dependent on both CD4~ 
recall responses and host-derived CCL3. 

Alterations in CCL21 expression along the lymphatic endothelium 
in the skin take place in the context of inflammation and have been 
associated with increased DC migration®. After Td pre-conditioning, 
higher levels of CCL21 in collected skin sites were detected in Td-treated 
wild-type mice compared to Ccl3~/~ mice, and exogenous administra- 
tion of CCL3 rescued the diminished levels of CCL21 at the vaccine sites 
of Ccl3~’~ mice only in the context of Td recall responses (Extended 
Data Fig. 9a), which paralleled the requirement of host CCL3 and Td 
recall responses for increased DC migration. Furthermore, Td pre- 
conditioning resulted in an increase in CCL21 within inguinal lymph 
nodes both ipsilateral and contralateral to the side of Td pre-conditioning 
(Extended Data Fig. 9b). Although lymph nodes contain high basal 
amounts of CCL21, we found CCL21 levels in bilateral inguinal lymph 
nodes of Td-treated Ccl3’~ mice to be lower than that of wild-type mice. 
Meanwhile, reconstitution of CCL3 back into Ccl3~’~ mice in con- 
junction with the induction of Td recall responses markedly increased 
the expression of CCL21 in VDLNs (Extended Data Fig. 9c). 

The apparent increase in progression-free and overall survival for Td 
treated-patients prompted us to determine whether Td pre-conditioning 
could inhibit tumour growth in a mouse subcutaneous tumour model 
in which B16F10 melanoma cells expressed OVA (B16-OVA). Pairwise 
comparisons revealed that tumour growth in Td plus OVA-DC mice 
was significantly delayed (Fig. 4a) in an antigen-dependent manner 
(Fig. 4b). Td pre-conditioning in Ccl3-/~ mice, as expected, was unable 
to inhibit tumour growth (Fig. 4c). Similarly, p/t (paucity of lymph node 
T cell) mice, which lack expression of CCL21 in the lymph nodes”, 
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Figure 4 | Td pre-conditioning improves responses in tumour-bearing 
mice. a, Inset, transformed growth curves, mixed linear effects model. Pairwise 
comparisons of regression line slopes (F-test, P< 0.0001). Day 22 volume 
(Td plus OVA-DC versus Td plus GFP-DC, two sample t-test, P = 0.002; 
n= 7).b, Antigen-specific responses with Td pre-conditioning. Day 15 volume 
(all groups, F-test, P = 0.016; pairwise Tukey t-tests, Td plus OVA-DC and 
B16-OVA (n= 7) versus Td plus OVA-DC and B16 (n = 5), P= 0.0004; Td 
plus OVA-DC and B16-OVA versus saline plus OVA-DC and B16-OVA 

(n = 6), P = 0.0002). Day 22 volume (Td plus OVA-DC and B16-OVA versus 
Td plus OVA-DC and B16, two sample t-test, P< 0.0001). c, Tumour 
growth in Ccl3-‘~ mice. Day 11 volume (all groups, F-test, P = 0.005). Day 27 
volume (Td plus OVA-DC WT versus Td plus OVA-DC Ccl3-‘~, two 
sample t-test, P = 0.042; n = 8). d, Anti-tumour responses in pit mice. Day 16 
volume (all groups, F-test, P = 0.004). Day 24 volume (Td plus OVA-DC pit 
(n = 7) versus Td plus OVA-DC WT (n = 6), two sample t-test, P< 0.05). 
Data in a-d are representative of three experiments; mean + s.e.m. ALT, 
autologous lymphocyte transfer. 


also failed to suppress tumour growth (Fig. 4d), indicating that Td- 
enhancement of OVA-DC vaccines also required intact CCL21 expres- 
sion in the host. 

Results from our trial seem to demonstrate that the modulation of 
CMV-specific DCs with Td pre-conditioning increases their migratory 
capacity and may improve clinical outcomes in patients with GBM. Cor- 
roborating studies in mice support these claims and underline CCL3 as 
a novel and important mediator of increased DC migration to VDLNs, 
in addition to its described roles in DC precursor mobilization to periph- 
eral sites of inflammation'*”” and in guiding naive CD8" localization 
to sites of DC-CD4" interactions in lymph nodes”. Our preclinical 
findings suggest that increased DC migration was only observed in the 
context of host CCL3 and Td recall responses. Similarly, we observed a 
slight increase in CCL21 skin and bilateral VDLN levels in the context 
of both host CCL3 and Td recall, which may havea role in DC homing 
to local lymph nodes. However, owing to the already high basal amount 
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of CCL21 chemokine in draining lymph nodes, it is possible that addi- 
tional factors such as interactions between host CCL3 and CD4* recall 
T cells are the predominant contributors to the increased DC accumu- 
lation in VDLNs. Our findings also suggest that DC migration should 
be further investigated as a predictive biomarker for immunotherapy 
studies. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


Patient selection, demographics and clinical protocol. The clinical protocol and 
informed consent were approved by the US Food and Drug Administration and 
Institutional Review Board at Duke University. Adults with a newly diagnosed 
World Health Organization (WHO) grade IV GBM, who had a gross total resec- 
tion and residual radiographic contrast enhancement on post-resection magnetic 
resonance imaging (MRI) not exceeding 1 cm in diameter in two perpendicular 
axial planes, and a Karnofsky performance scale score of =80, were eligible for the 
clinical study (FDA - IND-BB-12839, Duke IRB Pro00003877, NCT00639639). 
Histopathology of all specimens was initially read as GBM, but this diagnosis was 
re-confirmed by a second board-certified neuropathologist. Histological diagnosis 
included immunohistochemistry for MGMT protein expression. Benign endothe- 
lial cells staining positive for MGMT served as the internal control’. MGMT pro- 
moter methylation was performed by PCR. On the basis of published reports showing 
high expression of CMV viral proteins in >90% of GBM tumours””’, we elected 
not to include pp65 staining of tumour tissue as an eligibility criterion for this trial. 
All 13 patients on study received a gross total resection defined as >90% with resi- 
dual contrast enhancement of <1 cm”, and steroid doses could not exceed 2 mg day . 
of dexamethasone. No patients received intensity-modulated radiation therapy or 
had 5-aminolevulinic acid dye used during resection. Thereafter, all patients com- 
pleted a 6-week course of conformal external beam radiotherapy to a dose of 60 Gy 
with concurrent temozolomide at a targeted daily dose of 75 mgm * day’. After 
completion of standard therapy, all patients underwent an MRI for evidence of 
progressive disease. Those with evidence of progressive disease or required steroid 
therapy in excess of physiological levels at the time of vaccination were replaced. A 
total of 13 patients were enrolled and randomized before the first cycle of standard- 
of-care 5-day TMZ (200 mgm? day” '), but one progressed before randomiza- 
tion. For each vaccine, 2 X 10” mature pp65 RNA-pulsed DCs in 0.4 ml of saline 
were given intradermally in the groin. The first vaccination occurred on day 21 + 2 
of TMZ cycle 1. Although some patients (n = 5) were also randomized to receive 
an autologous lymphocyte transfer, those patients did not show a significant improve- 
ment in progression-free or overall survival. Patients given autologous lymphocytes 
were additionally administered 3 X 10’ cells kg’ intravenously with acetamino- 
phen (650 mg per os (po)) and Benadryl (25-50 mg po) given 30-60 min before 
infusion. The first three DC vaccines were given bi-weekly, and, at vaccine 4, 
patients were randomized to Td or unpulsed autologous DCs and received '""In- 
labelled DCs for migration studies. Vaccine 4 and additional monthly vaccines until 
tumour progression occurred on day 21 + 2 of successive TMZ cycles. A minimum 
of six cycles of adjuvant TMZ were required as per standard-of-care and continua- 
tion was at the discretion of the treating neurooncologist. Patients were monitored 
for treatment-related toxicity, and none of the patients experienced any vaccine or 
Td-related adverse events. 

Human autologous DC generation for vaccination and production of pp65- 
LAMP/A64 mRNA. DCs were generated using the method described previously”, 
and after collection the cells were frozen and assessed for contamination and lin- 
eage purity as previously published”. The 1.932-kilobase (kb) pp65 full-length 
cDNA insert was obtained from B. Britt and RNA was generated and transfected 
as previously reported”. 

Human DC migration studies. DC migration studies were done at the fourth 
vaccination. Patients were randomized by side to have one inguinal vaccination site 
pre-treated with either 1 x 10° unpulsed DCs or Td toxoid (1 flocculation unit (Lf). 
Saline was administered on the contralateral side. Vaccination site pretreatment 
was done 6-24 h before DC vaccination. DCs were labelled with 10 ,.Ciper 1 X 10” 
DCwith '' In (GE Healthcare) and divided equally in the two sites. Gamma camera 
images (GE Infinia Hawkeye) were taken immediately after injection and at 24 
and 48 h after injection to compare ‘*'In-labelled DC migration from the inguinal 
injection sites to the inguinal lymph nodes. 

Progression-free and overall survival. The more recent response evaluation cri- 
teria in solid tumours (RECIST criteria) judge progression by measuring the longest 
one-dimensional diameter and determine progression by a 20% increase in this 
diameter**. Once progression is detected on MRI, other imaging modalities such 
as positron emission tomography anda stereotactic brain biopsy of the enhancing 
region are incorporated to aid in determining progression. A stereotactic brain biopsy 
or resection demonstrating recurrence defines clinical progression. Progression-free 
survival was defined as the time until radiographic or clinical progression and was 
censored at the last follow-up if the patient remained alive without disease pro- 
gression. Overall survival was defined as the time until death and was censored at 
the last follow-up if the patient remained alive at the time of analysis. Progression- 
free and overall survival for all patients were calculated from both the time of sur- 
gery and from randomization to vaccine site pre-conditioning. 

Mice. All animal experiments were performed according to Duke University Insti- 
tutional Animal Care and Use Committee-approved protocols. Female C57BL/6 
wild-type, OT-I transgenic mice, Ccl3~’~, and red fluorescent protein (RFP) and 


green fluorescent protein (GFP) transgenic mice (ubiquitin promoter) were obtained 
from the Jackson Laboratory and were bred under pathogen-free conditions at 
Duke University Medical Center. The pit strain was provided by M.D.G. and main- 
tained at Duke University Medical Center. All mice were bred under pathogen-free 
conditions at Duke University Medical Center. 

Generation of mouse bone marrow-derived DCs, electroporation and pheno- 
typing. Bone-marrow-derived DCs were generated from 6-8-week-old female 
C57BL/6 wild-type, RFP* or GFP* transgenic mice and pulsed with OVA RNA as 
previously described”*. For phenotyping, anti-mouse phycoerythrin (PE)-conjugated 
CD11c (HL3), CD80 (16-10A1), CD86 (GL1), Ly-6G (1A8), MHC class II (I-A?; 
AF6-120.1) and isotype controls (IgG1; G235-2356, IgG>,,k; R35-95) were from 
BD Pharmingen. Cells were washed, resuspended in PBS and 2% FBS, incubated at 
4°C for 30 min, and washed again before use. 

Vaccine site pre-conditioning and DC vaccination in mice. For Td immunization, 
female 6-8-week-old C57BL/6 mice received a primary intramuscular vaccine of 
Td toxoid (Sanofi Aventis; Decavac; 1 Lf, 100 ul) administered bilaterally into the 
quadriceps muscle (50 pil per leg). An intramuscular booster (0.5 Lf, 50 il) was admin- 
istered two weeks later. Vaccine site pre-conditioning with saline or Td toxoid 
(0.5 Lf) was given intradermally (i.d.) 2 weeks after the booster and randomized to 
the right or left groin site. Mouse IgG antibody responses to Td were measured by 
ELISA (Xpress Bio). Serum from immunized mice was collected 2 weeks after each 
immunization before the next booster vaccine. DCs were resuspended at 1 10° 
per 100 pl PBS (Gibco) and administered id. on both sides 0.8 cm from the groin 
crease 24h after id. pre-conditioning. DCs injected in the groin ipsilateral to the 
Td pre-conditioning side were directly injected i.d. within the erythematous nodule 
produced by Td pre-conditioning. For recall response experiments using other 
protein antigen formulations, female 6-8-week-old C57BL/6 mice received a primary 
intramuscular vaccine of Prevnar 13 (Pfizer, Pneumococcal 13-valent conjugate 
vaccine, 1.32 pg, 100 pil) and Pedvax HIB (Merck, Haemophilus b conjugate vac- 
cine, 1.5 ig, 100 pl) administered bilaterally into the quadriceps muscle (50 ul per 
leg). Vaccine site pre-conditioning with saline or the protein antigen (50 ul) was 
given i.d. 2 weeks later and randomized to the right or left groin site. DC vaccines 
were given 24 h later, and migration to lymph nodes was assessed 48 h later. As with 
Td pre-conditioning, DCs injected in the groin ipsilateral to the pre-conditioning 
side were directly injected i.d. within the erythematous nodule produced by those 
formulations. For comparisons of other pre-conditioning agents, female 6-8-week- 
old C57BL/6 mice received a unilateral dose of unpulsed, mature DCs (1 X 10° in 
50 pl) or TNF-a (30 ng) administered i.d. at the groin site 24 h before DC vaccina- 
tion. On the basis of the previous work using these pre-conditioning regimens, DC 
migration to bilateral inguinal lymph nodes was assessed 24h later. For all other 
migration experiments, popliteal and inguinal lymph nodes were collected 48 h after 
DC vaccination and digested for flow cytometry. The percentage of migrating DCs 
was enumerated by gating on fluorescent DCs in wild-type VDLNs. DCs from 
wild-type (GFP and RFP) mice as negative controls before gating on fluorescent 
DCs within VDLNs to account for background autofluorescent cells that may have 
appeared in the GFP channel. For in vivo DC migration, a sample size (three per 
group) was based on empirical evidence from previously published reports as the 
size necessary for adequate statistical analysis of lymph nodes sampled”*. 
Depletion, adoptive transfer and CCL3 reconstitution. Female 6-8-week-old 
C57BL/6 mice were initially depleted of cellular subsets once daily (200 jig per mouse 
intraperitoneally) for 3 days before the first Td intramuscular immunization. Anti- 
mouse CD4 (GK1.5) and anti-CD8 (2.43) antibodies were purchased from American 
Type Culture Collection. Anti-mouse NK1.1 (PK136) and anti-CD19 (2D5) and 
control isotype depleting antibodies (IgG2a (2A3) and IgG2b (LTF-2)) were from 
BioXCell. Maintenance doses of depletion antibodies were administered at 3-day 
intervals (200 pg intraperitoneally) until vaccine site pre-conditioning with Td 
2 weeks later. For adoptive transfer experiments, Td-activated CD4* T cells (CD4..¢t) 
were induced in donor female 6-8-week-old C57BL/6 mice. Mice were primed (1 Lf 
100 ul) and boosted (0.5 Lf, 50 ul) intramuscular with Td 2 weeks apart. Three days 
after the i.d. Td pre-conditioning, donor inguinal lymph nodes, skin injection sites, 
and spleens were collected and processed for negative isolation of CD4* T cells 
(Miltenyi Biotec). Complementary sites from naive mice were collected simulta- 
neously and processed for negative isolation of CD4* T cells (CD4paive). A final 
dose of 6 X 10° CD4* T cells were administered intravenously into recipient mice 
two days before id. vaccination with RFP” DCs. For CCL3 reconstitution in 
Ccl3~’~ hosts, recombinant mouse CCL3 (R&D Systems) was administered intra- 
venously into the tail vein (10 1g per mouse) 12h before vaccination with RFP* 
DCs. Ccl3-’~ mice that were Td-immune were given recombinant CCL3 12 h after 
Td pre-conditioning at the vaccine site. 

Tumour implantation experiments. For tumour implantation experiments, B16F10- 
OVA cells were grown as previously published” and injected subcutaneously at a 
concentration of 2 X 10° cells in 200 ull of PBS in the flank of C57BL/6 mice 8 days 
before vaccine site pre-conditioning, the first intradermal vaccine of OVA RNA-pulsed 
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DCs, and autologous lymphocyte transfer (1:1 infusion of naive:OT-I OVA-specific 
T cells). Randomization of mice occurred after tumour inoculation before vaccine 
site pre-conditioning and the first DC vaccine first by compilation and then by 
random sorting into various treatment cages. Mice received two additional weekly 
vaccines of RNA-pulsed DCs on days 15 and 22. Ten days after tumour implanta- 
tion, flank sites were monitored daily for tumour growth, and tumour size was 
measured every 2 days. Tumour volume (millimetres cubed) was calculated by the 
formula (length X width? x 0.52) in a perpendicular fashion. Mice were euthanized 
when ulceration occurred or when the tumour reached either 2 cm in any direction 
or 2,000 mm”. Analysis of tumour growth focused on follow-up assessments before 
considerable dropout occurred. A logarithmic transformation yielded a linear rela- 
tionship between tumour volume and time for all curves. A mixed effects linear 
model that accounted for correlation of measurements within a mouse was used to 
examine the relationship between time and log [tumour volume + 1]. No blinding 
was done for these animal studies. 

Mouse tumour cell lines. The B16F10-OVA tumour cell line was a gift from R. 
Vile*’**. The B16F10 cell line was provided by I. Fidler’’. Cell lines were tested for 
mycoplasma before use. 

Mouse lymph node digestion and quantification of fluorescent and endogen- 
ous DCs. Lymph nodes were placed in 6-well culture plates containing 1 ml HBSS 
with Ca’*/Mg** (Gibco), digested for 35 min at 37 °C with collagenase A (1 mg ml’; 
Roche) and DNasel (0.2 mg ml 1, Sigma-Aldrich) and 20 mM EDTA (Invitrogen) 
was added for 5 min at room temperature to stop the reaction”*. Single-cell suspen- 
sions were prepared, cells were centrifuged (500g for 5 min) and resuspended in PBS 
with 2% FBS and stained with mouse allophycocyanin (APC)-conjugated CD11c 
(BD Pharmingen; HL3). For quantification of REP* or GEFP* counts in individual 
lymph nodes, samples were resuspended at an equal volume and 50 pl of counting 
beads (Invitrogen; 50,000 beads) were added to each sample. Cells were gated first 
on mouse CD11c* cells and then REP* or GFP* cells, and absolute cell counts/ 
lymph nodes were quantified using the following equation: REP* or GFP~ events 
X 50,000 beads/number of bead events. For endogenous DC migration experiments, 
cells were surface stained in PBS with 3% FBS, 10 mM EDTA, 5% normal mouse 
serum, 5% normal rat serum and 1% Fc Block (eBioscience; clone 93) and then 
intracellularly stained with anti-CD207 according to the manufacturer’s protocol 
(BD Cytofix/Cytoperm Kit). The cells were analysed by BD LSRII flow cytometer 
with FlowJo software (Tree Star). FITC-conjugated anti-B220 (RA-3-6B2), Alexa 
Fluor 700-conjugated anti-Ly-6G (1A8), APC-Cy7-conjugated anti-CD11b (M1/70), 
V450-conjugated anti-Ly-6C (AL-21) are from BD Pharmingen. PE-conjugated 
anti-CD207 (eBioL31), PE-Cy5.5-conjugated anti-CD1 1c (N418), PE-Cy7-conjugated 
anti-CD8 (53.6.7), APC-conjugated anti-CD103 (2E7), eFluor 605NC-conjugated anti- 
CD45 (30-F11) and eFluor 650NC-conjugated anti- MHC class II (I-A/I-E) (M5/ 
114.15.2) are from eBioscience. FITC-conjugated anti-CD3 (145-2C11) and anti- 
CD49b (DX5) are from BioLegend. The LIVE/DEAD Fixable Aqua Dead Cell Stain 
Kit is from Molecular Probes. 

Serum cytokine and chemokine analysis. Peripheral blood was collected 24 h after 
vaccine site pre-conditioning before DC vaccination. For patients, blood was collected 
in 10 ml venous collection tubes (BD), allowed to clot, spun at 1,170g for 15 min, 
and serum was stored at — 190°C. For mice, blood was collected in microtainer 
tubes (BD) allowed to clot for 30 min, spun at 8,000g for 5 min, and serum was 
stored at — 80 °C. Multiplex cytokine and chemokine kits were used for patient and 
mouse studies (cytokines and chemokines of interest for human, Affymetrix and 
Millipore: EPX080-10007-901, EPX010-12121-901, EPX010-12125-9, EPX010- 
10287-901, HCYTOMAG-60K-01 MDC; for mouse: Affymetrix and Millipore: 
EPX090-20821-901 ProcartaPlex 9 plex, MCYP3MAG-74K-01 MDC) following 
the manufacturer’s instructions. 

Expression of chemokines CCL3 and CCL21 in mice. Female 6-8-week-old 
C57BL/6 or Ccl3/~ mice were immunized with Td as described above. Twenty-four 
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hours after Td pre-conditioning, both left and right skin sites and inguinal lymph 
nodes were collected. For protein isolation, skin and lymph node samples were 
placed in pre-loaded bead lysis Eppendorf tubes (Next Advance) containing RIPA 
buffer (Sigma) with protease inhibitor cocktail tablets (Mini Complete Protease 
Inhibitor Cocktail Tablets, Roche Applied Science). Homogenization was performed 
with the Bullet Blender at 4 °C. Supernatants were collected by centrifugation, and 
chemokines were quantified by ELISA. Quantikine kits (R&D Systems) were used 
for CCL3, and RayBiotech ELISA kits were used for CCL21. Corresponding sam- 
ples were run for total protein concentration using the Bradford assay. CCL3 and 
CCL21 concentrations were normalized across samples and expressed as picograms 
or nanograms per milligram of total skin or lymph node protein. 

Statistical analysis. Statistics were reviewed by biostatisticians and tested as described 
in figure legends. Cox proportional hazard models were used to evaluate DC migra- 
tion and clinical outcomes. The log-rank test was used to compare Kaplan-Meier 
survival curves with censored patient data. For in vivo DC migration and VDLN 
studies, individual patient and mouse inguinal lymph nodes were treated as sepa- 
rate biological replicates, based on the underlying assumption that unilateral vaccine 
site pre-conditioning may preferentially affect local draining lymph nodes over non- 
draining sites. An unpaired two-sample Student’s t-test was used for two-group 
comparisons. Paired t-tests were used for comparisons between lymph nodes in 
the same host. One-way ANOVA was used to assess differences among three or 
more groups with post-hoc Tukey t-tests for two-group comparisons. Wilcoxon 
rank sum analyses were conducted for pairwise comparisons in serum cytokine/ 
chemokine panels. Signed rank tests were used to evaluate fold increase in chemo- 
kine levels. For tumour growth curves, a mixed effects linear model was employed 
using log-transformed curves and F-test for pairwise comparisons of regression 
line slopes and mean tumour volumes on the first day of detectable tumours (y- 
intercept). Repeated measures for calculation of slopes incorporated time between 
detectable tumour until considerable dropout occurred (maximal tumour size, ulcer- 
ation, or death). Mean tumour volumes at final time points when the entire con- 
trol cohort expired were compared between two groups using an unpaired two- 
sample student’s f test. Asterisks indicate level of significance (*P < 0.05, **P= 
0.01, ***P < 0.001, P > 0.05 not significant). No statistical methods were used to 
predetermine sample size. 
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Extended Data Figure 1 | Schema of clinical trial. SPECT/CT, single photon emission computed tomography/computed tomography; TMZ, temozolomide; 
XRT, external beam radiotherapy. 
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Extended Data Figure 2 | Recall responses induced by other CD4* T-cell- 
dependent protein antigens increase DC migration to VDLNs. Primary 
immunization and vaccine site pre-conditioning with CD4* T-cell-dependent 
protein antigens increase DC migration to VDLNs. Mice were immunized 
with either Haemophilus b conjugate (Hib) or pneumococcal 13-valent 
conjugate (PCV) intramuscularly, and 2 weeks later received vaccine site 
pre-conditioning with the recall antigen (recall) or saline (control). A separate 
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Recall Control Recall 


cohort of mice received saline only throughout the immunization schedule 
(saline). Scatter plot shows biological replicates of individually processed right 
and left iLN per mouse (4 mice per group). Percentage migration of REP* 
DCs to VDLNs; one-way ANOVA, P< 0.0001; post-hoc Tukey t-test, PCV 
control versus recall, P< 0.05, Hib control versus recall, P< 0.05. 
Representative of n = 3 experiments; mean + s.e.m. 
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Scatter plot shows biological replicates of individually processed right and left 
iLN per mouse (5 mice per group). CD4, ipsilateral versus contralateral, 
paired t-test, P = 0.41. Representative of n = 4 experiments; mean + s.e.m. 


Extended Data Figure 3 | Bilateral migration of OVA-DCs after Td pre- 
conditioning or Td-activated CD4* T-cell transfer. Uptake of injected DCs 
to right and left iLNs 48 h after DC vaccination in Td-immune mice receiving 
Td pre-conditioning or naive mice administered Td-activated CD4* T cells. 
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saline i.d. versus TNF-«, P< 0.05; saline id. versus DCs, P< 0.05; Td id. 
versus TNF-a, P = 0.042; DCs versus Td i.d. and DCs versus TNF-o, P > 0.05. 
b, DC migration to contralateral lymph nodes; one-way ANOVA, P = 0.003; 
post-hoc Tukey t-test, saline id. versus DCs or TNF-a, P > 0.05; Td i.d. versus 
TNF-a, DCs or saline i.d., P< 0.05. n values are biological replicates of 
individually processed right and left iLNs per mouse (4 mice per group). 
Representative of n = 3 experiments; mean + s.e.m. 


Extended Data Figure 4 | Unilateral pre-conditioning with unpulsed DCs 
or TNF-a results in increased DC homing to ipsilateral draining inguinal 
lymph nodes. Td-immune mice pre-conditioned with Td or saline before 
administration of OVA RNA-pulsed DC vaccine. Separate cohorts of naive 
mice received either 1 X 10° unpulsed DCs or 30 ng TNF-o on one side of the 
groin 24h before the bilateral REP* DC vaccine. DC migration was quantified 
24h after vaccination. a, DC migration to ipsilateral lymph nodes (one-way 
ANOVA, P = 0.0018; post-hoc Tukey t-test, saline i.d. versus Tdi.d., P = 0.007; 
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Human chemokines in serum 


Extended Data Figure 5 | Serum cytokine and chemokine profile after 

Td pre-conditioning in patients and mice. a, Serum cytokine panel of patients 
following vaccine site pre-conditioning with Td or unpulsed DCs; Wilcoxon 
rank sum, IFN-y and IL-4, P< 0.05 (n = 6 patients). b, Similar to a in 

mice; Wilcoxon rank sum, all comparisons, P > 0.05 (Td recall n = 5, non-Td 
n = 6). c, Patient serum chemokines after vaccine site pre-conditioning. Patient 
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CCL2 and CCL3 in Td recall (Td, n = 6) versus non-Td (unpulsed DC, n = 5); 
one-way ANOVA and Wilcoxon rank sum, P < 0.05. d, Mouse CCL22, CCL7 
and CCL3 in Td recall (Td, n = 8 mice) and non-Td (saline, n = 8 mice); 
one-way ANOVA and Wilcoxon rank sum, P < 0.05. For a-d, individual 
values represent biological replicates; mean + s.e.m. 
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Extended Data Figure 6 | Td vaccine site pre-conditioning results in CCL3_ 1 = 2 mice with n = 2 technical replicates performed per mouse. d, CCL3 
upregulation in Td-immune hosts. a, CCL3 production in skin site after Td remains increased at the Td pre-conditioning site in the skin after DC 
pre-conditioning (Td ipsilateral versus contralateral). Representative of four vaccination (24, 48 and 72 h, one-way ANOVA, P = 0.0001, Td plus Tdi.d. and 
independent experiments. b, CCL3 production in skin after Td recall response. | OVA-DC versus Td plus saline i.d. and OVA-DC, and saline plus saline i.d. and 
c, CCL3 induction at skin site is abrogated with previous host depletion of OVA-DC, P< 0.05, post-hoc Tukey t-test). Individual values represent 
CD4* T cells. Bars in a~c represent CCL3 protein detected in skin sites from _ biological replicates from n = 4 mice; mean + s.e.m. 
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Extended Data Figure 7 | Migratory DC subsets in wild-type and Ccl3~’~__ inguinal lymph nodes after skin pre-conditioning with Td. LC, Langerhans 


mice after induction of Td recall responses. Both wild-type and Ccl3-/~ cells; MoDC, monocyte-derived DCs. b, Day-8 migration of LC population to 
mice were first immunized with Td and then challenged with Td pre- non-draining inguinal lymph nodes in Ccl3-’~ hosts is reduced in absence of 
conditioning. Migration of endogenous DC subsets to inguinal lymph nodes CCL3; two-sample t-test, P = 0.046. Representative of three experiments. 
contralateral to the site of Td pre-conditioning was assessed at days 4 and 8 Individual values represent biological replicates from n = 4 mice; 

after Td administration. a, Gating strategy used to quantify DC subsets in mean + s.e.m. 
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Extended Data Figure 8 | Anti-tetanus toxoid memory responses are 
induced and maintained in wild-type and Ccl3~’~ mice throughout Td 
priming and boosting. Wild-type and Ccl3’~ mice primed and boosted 
with Td. Serum from immunized mice was collected 2 weeks after each 
immunization before the next booster vaccine (for each boosting phase, 
wild-type versus Ccl3-/—, two-sample t-test, P > 0.05). im., intramuscular. 
Scatter plot showing averaged values from n = 4 mice with n = 2 technical 
replicates performed per mouse. Representative of three experiments; 
mean + s.e.m. 
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Extended Data Figure 9 | CCL21 levels in Td pre-conditioning skin sites 
and draining lymph nodes of wild-type and Ccl3~’~ mice. a, CCL21 levels in 
skin site of Ccl3-’~ hosts after induction of Td recall response and CCL3 
administration. Mixed model accounting for within-mouse correlation of 
measurements, F-test, P< 0.001; pairwise comparisons, WT Td plus Td id. 
(n = 4) versus Ccl3 ’ Td plus Td id. (n = 3), P= 0.049; Ccl3-“ Td plus T 
id. and CCL3 iv. (n = 4) versus Ccl3~/~ Td plus Td i.d. and versus Ccl3~/— 
plus CCL3 i.v. (n = 4), P = 0.044 and P = 0.0045, respectively. Scatter plot 
shows averaged values with n = 2 technical replicates performed per mouse. 
b, Bilateral iLN CCL21 levels in wild-type mice after Td recall with skin site 
pre-conditioning. Bars represent CCL21 protein within iLN ipsilateral and 
contralateral to the side of Td pre-conditioning from n = 2 mice with n = 2 


Ipsilateral 


LN CCL21 (naimg protein) 


dq Control 


cist 


Contralateral Td prime (i.m.) 


Td pre-conditioning (id) 


CCL3 iv. 


Control 


Td 


technical replicates performed per iLN. Representative of three experiments. 
c, Increased lymph node CCL21 in Ccl3~’~ hosts after CCL3 reconstitution 
and induction of Td recall response (all two group comparisons). CCL21 levels 
in bilateral iLNs of Ccl3-’~ hosts after induction of Td recall response and 
CCL3 administration. Mixed model accounting for within-mouse 

correlation of measurements, F-test, P< 0.001; pairwise comparisons, WT Td 
plus Tdi.d. (n = 4iLNs) versus Ccl3-/- Td plus Tdi.d. (n = 3 iLNs), P = 0.045; 
Ccl3-’~ Td plus Td id. and CCL3 iv. (n = 4 iLNs) versus Ccl3-/~ Td plus 
Td id. and versus Ccl3~/— plus CCL3 iv. (n = 4 iLNs), P = 0.0066 and 

P= 0.026, respectively. Scatter plot shows averaged values with n = 2 technical 
replicates performed per lymph node sampled. In a and c, mean + s.e.m. 
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Extended Data Table 1 | Clinical trial patient characteristics 


MGMT ‘ 5 EORTC 
Patient Sex Age Biopsy location Race KPS WHO MMSE IDH1 MGMT promoter Vaceins site Ere z 08 : =, PFS é ‘ Me < predicted EORTC BEA RPA 
score - pre-conditioning (randomization) (randomization) (diagnosis) (diagnosis) - O-E class O-E 
methylation median OS 

1 F 46 R. frontal Ww 100 ie) 29 - N/A N/A Td 8.8 19.2 15.4 25.7 26.8" “14 ll 78 

2 F 32 R. frontal w 100 ie) 29 + - - DC 21.7 35.2 278 41.3 13.09 28.31 Ul 23.4 

3 M 62 R. temporal w 90 1 29 - - - Dc 49 10.9 11.2 17.3 9.8 75 a 6.2 

4 M 71 L. frontoparietal Ww 90 1 30 - - + Td 41.4 41.4 47.3t 47.3t 16.4 30.9 \V 36.2 

5 M 43 L. temporal w 90 1 28 - - + DC 25 12.2 10.0 19.7 16.4 3.3 ul 1.8 

6 F 59 R. temporal Ww 80 1 30 - - - Td 39.5 39.5 45.0t 45.0t 9.8 35.2 lV 33.9 

€ M 75 R. temporal B 90 1 26 - + + Td 9.5 14.7 15.4 20.6 11.8 88 Iv 9.5 

8 F 58 L. parietal H 90 1 22 - + N/A DC 25 13.4 9.1 20.0 13.1* 6.9 IV 8.9 

9 F 30 R. temporal Ww 90 1 29 NIA + N/A Td 36.6 36.6 44.1t 44.4% 21.3* 22.8 tl 26.2 

10 F 66 L. frontoparietal Ww 80 1 24 - + - DC 3.9 9.5 10.4 16.0 6.9 9.1 Iv 49 

14 F 28 L. frontotemporal w 100 0 28 - - - Dc 64 74 12.5 13.8 13.0 08 ul “44 

12 M 59 R.posteroparietal Ww 90 1 29 § § § Dc § § 54 19.0 18.1° 09 IV 79 

13 M 71 L. posterotemporal W 90 1 21 - + + Td 12:7 15.2 18.5 20.9 11.8 94 \V 9.8 
Median 

De 58.5 90 1 28 44 11.6 10.8 18.5 13.09 6.99 lV 6.2 
Median 

Td 65 90 1 29 NE NE NE NE 144 >16 \V >18 


Demographic and prognostics factors of newly diagnosed GBM patients with vaccine site pre-conditioning randomization strategy and corresponding progression-free and overall survival from the time of surgery 
and from randomization to pre-conditioning. Observed and predicted survival times are expressed in months. Median values are shown for Td and unpulsed DC cohorts. Predicted median overall survival for RPA 
class yielded 17.9 months for class Ill and 11.1 months for class IV based on recursive partition analysis?®. Model 3 of the EORTC scoring system that incorporated MGMT promoter methylation status, MMSE score, 
and WHO performance status was used to generate predicted median survival rates. 

* Model 2 of EORTC scoring system if methylation status unavailable. 

No progression. 

{Alive. 

§ Patient progressed before time of randomization for vaccine site pre-conditioning. 

EORTC, European Organization for Research and Treatment of Cancer; L, left; IDH1, isocitrate dehydrogenase type1; KPS, Karnofsky performance status; MGMT, 0®-methylguanine-DNA methyltransferase; 
MMSE, mini-mental state examination; NA, tissue not available; NE, not estimable; O—E, observed — expected survival months; OS, overall survival; PFS, progression-free survival; R, right; RPA, recursive 
partitioning analysis; WHO, World Health Organization. 
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A human tRNA synthetase is a potent 
PARP1-activating effector target for resveratrol 


Mathew Sajish’ & Paul Schimmel!” 


Resveratrol is reported to extend lifespan’? and provide cardio- 
neuro-protective’, anti-diabetic’, and anti-cancer effects** by initi- 
ating a stress response” that induces survival genes. Because human 
tyrosyl transfer-RNA (tRNA) synthetase (TyrRS) translocates to the 
nucleus under stress conditions®, we considered the possibility that 
the tyrosine-like phenolic ring of resveratrol might fit into the active 
site pocket to effect a nuclear role. Here we present a 2.1 A co-crystal 
structure of resveratrol bound to the active site of TyrRS. Resveratrol 
nullifies the catalytic activity and redirects TyrRS to a nuclear func- 
tion, stimulating NAD* -dependent auto-poly-ADP-ribosylation of 
poly(ADP-ribose) polymerase 1 (PARP1). Downstream activation 
of key stress signalling pathways are causally connected to TyrRS- 
PARP1-NAD* collaboration. This collaboration is also demonstrated 
in the mouse, and is specifically blocked in vivo by a resveratrol- 
displacing tyrosyl adenylate analogue. In contrast to functionally 
diverse tRNA synthetase catalytic nulls created by alternative splic- 
ing events that ablate active sites’, here a non-spliced TyrRS cata- 
lytic null reveals a new PARP1- and NAD*-dependent dimension 
to the physiological mechanism of resveratrol. 

Human TyrRS is a homodimer ofa 528 amino-acid polypeptide that 
harbours an appended eukaryote-specific carboxy (C)-terminal EMAP-II 
domain (Fig. 1a)*’. High-resolution structures of the catalytic unit of 
Hs TyrRS, known as mini-TyrRS, and the C-domain have been deter- 
mined'*"’. We found resveratrol (RSV) strongly inhibited TyrRS with 
an inhibition constant (K;) value of 22 uM (Extended Data Fig. la-c). 
Crystallization of Hs mini-TyrRS with RSV and, separately, with tyr- 
osine yielded co-crystal structures (at 2.1 A) (Fig. 1b, Extended Data 
Fig. 2a, b and Extended Data Table 1; Protein Data Bank (PDB) acces- 
sion numbers 4Q93 and 4QBT). 

While the phenolic ring of RSV and of the tyrosine have the same 
disposition in the respective co-crystals, accommodation of the cis- 
conformation of the dihydroxy ring of RSV forces a local structural 
change near the linker to the C-domain (Fig. 1b and Extended Data 
Fig. 2a). An RSV-promoted conformational change in TyrRS may drive 
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the predominant trans RSV (in solution) into a cis conformation (Ex- 
tended Data Fig. 2c, d). 

Associated with a previous study’, a distinct TyrRS-PARP1 inter- 
action was observed. PARP 1 is a major modulator of NAD* metabolism 
and its related signalling’*. Because RSV acts through NAD * -dependent 
proteins”, the TyrRS-PARP1 interaction was further studied. Given 
that RSV treatment elicits a stress response’, serum starvation was used 
to mimic a general ‘stand-alone’ stress condition so that common sig- 
nalling pathways, if any, between RSV treatment and a general stress 
condition, could be compared ex vivo. Either serum starvation or RSV 
treatment promoted nuclear translocation of endogenous TyrRS in HeLa 
cells (Fig. 1c). Translocation was observed under different stress con- 
ditions (heat shock and endoplasmic reticulum stress; Extended Data 
Fig. 3a), suggesting that TyrRS is a general stress transducer. Nuclear 
translocation of endogenous sTPA) was concomitant with strong auto- 
PARylation of PARP1 (PARP1”“®) (Fig. 1c and Extended Data Fig. 3a). 

Ex vivo RSV also strongly promoted association of TyrRS with PARP1, 
and robust auto-poly-ADP-ribosylation of PARP1 (Extended Data 
Fig. 3b). Effects of RSV were blocked by a Tyr-AMP analogue (Tyr- 
SA (5'-O-[N-(9L-tyrosyl) sulphamoyl] adenosine)), but not by Gly-SA 
(a control targeting GlyRS) (Extended Data Fig. 3b, c). Similar, but less 
pronounced, PARylation was seen with serum starvation. Enhanced 
PARylation correlated with increased amounts of TyrRS in the nucleus, 
which occurred upon serum starvation. Thus, both serum starvation 
and RSV promoted nuclear translocation of TyrRS and activation of 
PARPI. 

Cell lysates treated with the PARG hydrolyase and its hydrolase- 
inactive mutant supported the idea that TyrRS preferentially bound to 
non-PARylated PARP1 (Extended Data Fig. 3d, e). TyrRS interacted 
specifically with the C-domain of PARP1 (CT-PARP1) (Extended Data 
Fig. 3f). Neither mini-TyrRS nor the TyrRS C-domain interacted with 
PARPI1; only full-length native TyrRS bound PARP1 (Extended Data 
Fig. 3g, h). In the absence of RSV, concentration-dependent activation 
of PARP1 by TyrRS was observed in vitro (Fig. 2a, top, and Extended 


TyrRS. a, Cartoon illustration of the domain 
organization of human TyrRS. Both domains 

are connected by a linker of approximately 20 
amino acids. b, Left, Electron density of co-crystal 
X-ray structures (2.1 A) of TyrRS bound to 
cis-resveratrol and to L-tyrosine. Right, resveratrol 
induced a local conformational change relative to 
bound tyrosine at the active site. c, Both serum 
starvation and resveratrol treatment (5 uM) 
facilitated the nuclear translocation of TyrRS with a 
concomitant increase in the PARylation of PARP1. 
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Data Fig. 4a, b). RSV enhanced in vitro auto-PARylation with the half- 
maximal effect at roughly 10 nM (Fig. 2a, middle), well below the K; 
(about 22 4M) in Extended Data Fig. la-c. Thus, PARP1 may alter the 
apparent affinity of RSV for TyrRS. Also, concentration-dependent 
quenching of PARylation of PARP1 by Tyr-SA was evident (Fig. 2a, 
bottom). Lastly, while broken DNA normally activates PARP1 (ref. 13), 
Tyr-SA did not interfere with this DNA-dependent-pathway of PARP1 
activation in vitro (Extended Data Fig. 4c). Therefore, TyrRS-RSV ac- 
tivation of PARP1 is distinct. 

Ectopically expressed TyrRS in HeLa cells for 0-24 h caused progres- 
sive increase in cellular concentrations of the synthetase (Fig. 2b, top) 
and a correlated progressive increase in PARP1°** (Fig. 2b, top). A 
TyrRS mutant (TyrRS-dNLS), with reduced nuclear localization’, re- 
duced activation of PARP1 (Extended Data Fig. 4d). Effects of RSV ex 
vivo at various concentrations (Fig. 2b, middle) and of Tyr-SA (Fig. 2b, 
bottom) mirrored those seen in vitro (Fig. 2a, middle and bottom). 

RSV and L-tyrosine produce different bound conformations of TyrRS, 
seen locally in the high-resolution structures of the two co-crystals 
(Fig. 1b and Extended Data Fig. 2a). These differences could affect the 
disposition of the C-domain needed for the interaction of TyrRS with 
PARP1. This domain is tethered to the mini-TyrRS catalytic unit by a 
flexible linker (Fig. 2c, left). By breaking the Y341 OH-H bond to a 
backbone carbonyl oxygen, a Y341A mutation releases tight tethering 
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Figure 2 | TyrRS facilitates the activation of PARP1 in an active-site- 
dependent manner. a, Top: TyrRS activates PARP! in an in vitro assay. 
Middle: resveratrol potentiates TyrRS-mediated activation of PARP1. Bottom: 
Tyr-SA blocks the resveratrol-mediated activation of PARP1. b, Top: TyrRS- 
V5 overexpression activates PARP1 in HeLa cells in a concentration-dependent 
manner. Middle: resveratrol treatment activates PARP1 in HeLa cells and 
enhances TyrRS interaction with PARP1. Ctrl, control. Bottom: Tyr-SA blocks 
the resveratrol-mediated interaction of TyrRS and activation of PARP1. 

c, Cartoon illustration of the C-domain disposition in TyrRS (left) and 
Y341ATyrRS (right). WT, wild type. d, ¥Y341ATyrRS enhances its interaction 
and activates PARP1 compared with wild type. IP, immunoprecipitation; 

1B, immunoblot. 
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of the C-domain to the catalytic domain”* and its reorientation”® (Fig. 2c, 
right). 

In the absence of RSV, Y341A-TyrRS showed robust interaction with 
and activation of PARP1 in vitro (Fig. 2d). If Y341A is shifted more 
towards a conformation needed to bind to PARP1, then it would be 
more sensitive than wild-type TyrRS to low concentrations of RSV. This 
expectation was fulfilled (Extended Data Fig. 4e). TyrRS-Tyr may thus 
have a conformation that prevents interaction with PARP1, while TyrRS- 
RSV has a distinct conformation that binds PARP1. In the absence of 
either ligand, a dynamic equilibrium populates both forms (Fig. 2c). 

A working model of downstream signalling markers was developed 
as detailed in Methods (Fig. 3a). Effects from serum starvation and RSV 
administration involve cascades through protein acetylation and phos- 
phorylation effects of serum starvation and of RSV, which have not 
been linked to TyrRS nor activation of PARP1. Under conditions of 
serum starvation or RSV administration, the response in HeLa cells of 
PARylated and acetylated proteins, together with the aforementioned 
cell-signalling proteins, was monitored for up to 1h. Either RSV or 
serum-starvation promoted production of AcK-Tip60 and AcK-ATM 
(Fig. 3b). In a temporal response from 0 to 8 h, either or both of serum 
starvation or 1 1M RSV promoted increased levels of whole-cell PARylated 
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Figure 3 | Resveratrol and serum starvation mimic similar downstream 
signalling events mediated by PARP1 activation. a, Cartoon illustrating the 
molecular basis and the integration of different signalling pathways to mediate 
a TyrRS/PARP1-activated stress response evoked by either resveratrol or 
different stress conditions. b, PARP 1-activating conditions enhance Tip60- 
mediated activation of ATM. c, Time course (minutes and hours) of poly-ADP- 
ribosylation status and associated signalling events as depicted in Fig. 3a 
after serum starvation and d resveratrol (1 1M) treatment. 
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proteins, Hsp72, AcK382-p53, p-AMPK, pSer36-H2B, p21, FOXO3A, 
14-3-3, HO-1, NAMPT, SIRT1, and SIRT6 (Fig. 3c, d). Also, RSV ac- 
tivated acetylation of Tip60 ex vivo in a concentration-dependent man- 
ner (Extended Data Fig. 5a). As expected from previous reports that 
acetylation of p53 is indispensible for its activation”, a transient increase 
in acetylation of p53 (AcK382)' was evident (Fig. 3c, d), possibly through 
a transient inhibition of SIRT1 by the nicotinamide’ being produced. 
Consistently, at 15 min, an RSV concentration-dependent reduction in 
total NAD” with concomitant production of nicotinamide and ADP- 
ribose was observed (Extended Data Fig. 5b, c). Thus, serum starvation 
and RSV rapidly increased all monitored proteins in a way that is de- 
pendent on having active PARylation. This rapid increase is consistent 
with work showing rapid upregulation of PARP1 target genes’””°, pos- 
sibly because of a broader effect of PARP1 on transcription and assoc- 
iated stress signalling’*”’. 

At longer times, and continuing at low concentrations (=10 UM) of 
RSV, similarity between responses to RSV and serum starvation was 
ostensibly seen. By 1 h, low concentrations of RSV elevated the NAD* 
levels similar to those of serum-starvation conditions (Extended Data 
Fig. 5d). Responses of key stress-signalling markers in a more extended 
period of 8-24h at 5 1M RSV were mostly similar to those of serum 
starvation (Extended Data Fig. 5e, f). At 5 uM RSV, levels of PARP peas 
were sustained longer compared with what was observed at 1 1M RSV 
(compare 2 hand 8h time points in Fig. 3d with Extended Data Fig. 5f) 
and we thus continued further studies at 5 uM RSV. 

A short interfering RNA (siRNA) directed against PARP1 
(siRNAPARP1) effectively abrogated the RSV-stimulated expression of 
Hsp72, p-AMPK, SIRT1, FOXO3A, SESN2, NAMPT, PUMA, and SIRT6 
(Fig. 4a). Also, 5 1M RSV promoted induction of BRAC] and p14ARF 
(whose genes are directly regulated by PARP1 (refs 22, 23)). Consis- 
tently, the RSV-stimulated inductions of BRCA1 and p14ARF were pre- 
vented by siRNA*“*?! (Fig, 4a), further showing involvement of PARP1 
in RSV-mediated induction of these proteins. 

In Fig. 4a, RSV treatment led to enhanced activation of p53. How- 
ever, in the absence of RSV, knockdown of PARP1 resulted in a back- 
ground increase in the products of these p53-regulated genes. Possibly 
p53 was activated by the knockdown of PARP1, because removal of 
PARP] would enhance background levels of DNA damage. Indeed, acti- 
vation of p53, as seen by enhanced acetylation and phosphorylation of 
p53, was observed (Fig. 4a). 

siRNA?*"®S_knockdown at 5 uM RSV eliminated induction of, and 
dramatically reduced amounts of, PARylated PARP1, of whole acetylated 
proteins, and of AcK382-p53, AcK16-H4 (Tip60 activation), p-AMPK, 
and pSer36-H2B (AMPK activation) (Fig. 4b). (Separately, RSV (5 [1M) 
addition did not affect the viability of HeLa cells expressing either 
siRNAY®S or siRNA’“?”! (Extended Data Fig. 6).) In contrast, 
siRNA“! _knockdown did not affect RSV (5 tuM)-mediated produc- 
tion of whole acetylated proteins or, among other downstream mar- 
kers, activation of PARP1 and induction of SIRT6, FOXO3A, NAMPT, 
AcK16-H4, p-AMPK™, and pSer36-H2B (Extended Data Fig. 7). Col- 
lectively, Fig. 4a, b shows that ex vivo TyrRS and PARP1 collaborate to 
activate RSV- and serum-starvation responses. 

Although the RSV-TyrRS-PARP1 axis initially consumes NAD“, at 
low RSV (5 uM), NAD” levels are transiently raised after 1 h owing to 
NAMPT activation (Extended Data Fig. 5d). At 1 hat low RSV, induction 
of NAD" levels was abolished with either siRNA'™®S or siRNAP“R?? 
(Extended Data Fig. 8). NAD* depletion by NAMPT inhibition abol- 
ished RSV-mediated induction of BRCA1, FOXO3A, NAMPT, SESN2 
and SIRT6 (Fig. 4c). These results are consistent with those using inhi- 
bitors against PLC-y and the ryanodine receptor”, which inhibit PARP1 
activation’® and ADP-ribose-mediated calcium signalling”, and which 
prevented RSV-mediated AMPK activation. Similarly, a PARP1 inhib- 
itor prevented LKB1 activation by RSV”. 

Mice were injected intravenously the through tail vein with 100 ll of 
10 uM RSV and, after 30 min, were euthanized. Compared with the PBS 
intravenous-injected control, PARylated and acetylated proteins, along 
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Figure 4 | Resveratrol treatment activates TyrRS-PARP1-driven signalling 
events in mouse tissues. a, b, siRNA of PARP] (a) or siRNA of TyrRS 

(b) abrogates resveratrol-mediated downstream signalling events. HeLa cells 
were treated with siRNA?“®”! or siRNA™”®S for 60 h to knockdown PARP] or 
TyrRS expression levels by approximately 70-80%. Knockdown efficiency 
was monitored by with «-PARP1 or o-TyrRS. For siRNA™™, HeLa cells were 
treated with either serum starvation or resveratrol (5, 10, 25, and 50 uM) 

for 45 min. The dashed line in Fig. 4a represents the demarcation between 
siRNA© and siRNAPAP?!, all the samples were run on the same gel. 

c, NAMPT inhibition abrogates resveratrol-mediated downstream signalling 
events. HeLa cells were pre-treated with STF-118804 (100 nM), a NAMPT 
inhibitor, for 16h and then treated with resveratrol (5 1M), with samples 
collected at intervals as indicated. Various signalling events, as depicted, were 
monitored using appropriate antibodies. Acetylation status was monitored 
using o-acetyl-lysine. The dashed line separates results without and with 

the inhibitor. d, Resveratrol-mediated activation of PARP1 is blocked by 
Tyr-SA. Activations of Tip60 and AMPK were monitored by using «-AcK16- 
H4 and «-pSer36-H2B, respectively. e, Resveratrol-mediated interaction 

of TyrRS with PARP1 and acetylation of p53 are blocked by Tyr-SA. 
Immunoprecipitation of PARP1 and p53 from muscle tissue demonstrated 
RSV-mediated TyrRS-PARP1 interaction and p53 acetylation. 


with AcK16-H4 and pSer36-H2B, were significantly increased in skeletal 
muscle (Extended Data Fig. 9a, b). Results in cardiac tissue were similar 
(Extended Data Fig. 9c, d). In addition, PARP! activation diminished 
back to normal by 24 h (Extended Data Fig. 9e). Consistent with ex vivo 
assays, tissue samples from RSV-injected mice showed higher levels of 
TyrRS-PARPI interaction together with increased auto-PARylation 
(Extended Data Fig. 9f). 

Tyr-SA (5 11M) was added to the intravenous-injection with RSV. No 
interaction of TyrRS with PARP1 could be detected in skeletal or car- 
diac muscle and levels of PARylated and acetylated proteins did not 
increase. In contrast, co-injection of Gly-SA or cycloheximide (CHX, 
protein synthesis inhibitor) did not block RSV/TyrRS-mediated acti- 
vation of PARP1 and its downstream signals (Fig. 4d (muscle) and 
Extended Data Fig. 9g, h (cardiac)). Consistent with ex vivo assays, im- 
munoprecipitation of PARP1 (from harvested muscle tissue) pulled 
down TyrRS, and immunoprecipitation of p53 showed its increased 
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acetylation. Pull-down of TyrRS and p53 acetylation were blocked in 
mice that had co-injections of Tyr-SA or the AG14361 (SelleckChem) 
PARPI inhibitor, but not of Gly-SA or cycloheximide (Fig. 4e). These 
responses in vivo parallel ex vivo assays, and support the idea that TyrRS 
is a major effector target for RSV that acts through PARP 1 as described 
in Fig. 3a. 

The mechanisms of action of RSV and of the stress response are both 
linked here to the activation of PARP1 through TyrRS and NAD ©. The 
interaction of RSV with TyrRS could be viewed as an example of 
xenohormesis through interactions of a natural ligand with a protein 
target”’. In this instance, the natural ligand blocks the active site to cre- 
ate a tRNA synthetase catalytic null with a new, orthogonal function. 
This kind of catalytic null is in contradistinction to those created by al- 
ternative splicing events that specifically remove the active site’. The 
RSV-TyrRS activation of PARP1 is readily observable in a functional 
in vitro assay even at sub-micromolar concentrations (for example, 10 nM; 
Fig. 2a, middle). Thus, TyrRS—RSV-induced PARP 1 activation appears 
at significantly lower RSV concentrations than seen with RSV func- 
tional binding to other targets'”*”°. As a consequence, the direct effects 
of RSV binding to these targets of RSV are layered over a pre-existing 
foundation that comes from the TyrRS-RSV-PARP1-NAD" connection. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 

Construction of the working model. Because both RSV and serum starvation ac- 
tivate PARP] leading to the metabolic breakdown of NAD" into nicotinamide and 
ADP-ribose, downstream signalling markers associated with the NAD* metabolic 
flux were investigated. Targets were selected if they were activated and/or inhibited 
under conditions of RSV treatment, stress, or NAD* metabolic by-product treat- 
ment (nicotinamide and ADP-ribose). For example, nicotinamide activates Tip60 
(ref. 31) through SIRT1 inhibition’* and Tip60 activates ATM”, the p53 kinase. 
Protein p53 is a master regulator of NRF2 through p21 induction® and inducible 
expression of survival genes such as SIRT6 (ref. 34), SIRT1 (ref. 35), sestrins*’, and 
FOXO3A”. (SIRT6 promotes DNA repair under stress by activating PARP] (ref. 38) 
and SIRT1 downregulates the over-activation of PARP] (ref. 39).) Similarly, ADP- 
ribose (metabolic by-product of NAD™ and an inhibitor of SIRT1”°) is a potent 
Ca?* channel opener’, a substrate for AMP production”, and hence an indis- 
pensible component for AMPK activation by resveratrol’ and the stress response 
by phosphorylating H2B“*. AMPK is known to upregulate NAMPT expression”, 
the major regulator of NAD“ levels in the cell’. NAD* is major substrate for sirtuins 
(deacetylation or mono-ADP-ribosylation***”“*) and PARPs ((mono/poly)-ADP- 
ribosylation’*“’). Although not depicted, p38 and NF-«B (major transducers of stress 
signalling) are also regulated by TAK1 in a PARP1/ATM-dependent manner**". 
Antibodies. All antibodies were obtained from Cell Signaling Technology unless 
otherwise mentioned and those as follows: «-ATM was from Abcam and GeneTex; 
a-PARP1 and «-poly(ADP-ribose) were from BD Biosciences; «-p84 was from 
Sigma; «-pSer36-H2B from ECM Biosciences; rabbit x-human TyrRS polyclonal 
antibodies were homemade. «-Tip60 was from Millipore; «-AcK16-H4 was from 
Active Motif; and o-Hsp72 (catalogue number SPA-812) was from Assay Designs. 
Cell culture, transfection, and resveratrol treatment. HeLa cells (from ATCC, 
mycoplasma free) were cultured in a humidified incubator with 5% CO, in DMEM 
medium (Invitrogen) supplemented with 10% FBS (Invitrogen) and 1 X penicillin/ 
streptomycin. The cells were transfected with pcDNA6-TyrRS-V5 (wild type), or 
pcDNA3.1-ZZ-PARP1 using Lipofectamine LTX (Invitrogen). Resveratrol (Santa- 
Cruz Biotechnologies), at a series of concentrations from 0 to 50 uM, was used to 
treat HeLa cells 0-24 h in experiments described. As the effects of RSV are strongly 
influenced by the background level of PARP1 activity, the experiments were done 
only after checking to verify that background PARP] activation was low. Our cri- 
terion was to see PARP] activation upon serum starvation (DMEM medium without 
FBS), along with Hsp72 induction”. When serum starvation induced PARP] acti- 
vation with Hsp72 induction within 30-45 min (from the same batch of a split of 
HeLa cells), we consistently saw PARP1 activation at lower concentrations of RSV 
(1 pM). Cell culture medium was supplemented with Tyr-SA or Gly-SA (RNA-TEC) 
where mentioned. Silencer Select Pre-Designed siRNA against PARP] (siRNA s1099, 
5'-GCAGCUUCAUAACCGAAGAtt-3’), SIRT1 (siRNA s23770, 5'-GGCUUG 
AUGGUAAUCAGUAtt-3’) and TyrRS (siRNA s443, 5’-GGACUUUGCUGCU 
GAGGUUtt-3’) were purchased from Invitrogen and transfected into HeLa cells 
using Lipofectamine RNAiMAX (Invitrogen). NAMPT, the rate-limiting enzyme 
in the NAD* salvage pathway, was inhibited using STF-118804* according to the 
manufacturer’s protocol (SelleckChem). The effect of a concomitant increase in 
nicotinamide production with a reduction of NAD* concentration (leading to SIRT1 
inhibition) was monitored by immunoblotting with «-AcK382-p53, an acetylation 
site known to be specifically targeted by SIRT1 (ref. 1). Monitoring of «-pSer 15- 
p53 indicated if enhanced acetylation preceded the phosphorylation event™. Total 
p53 was blotted with o-p53. Transactivation of p53 was determined by monitoring 
its known targets, such as p21, PUMA, 14-3-3, FOXO3A, SESN2, and SIRT6. Acti- 
vation of NRF2 was further confirmed by following HO-1 expression, using the cog- 
nate antibodies. Activation of AMPK (Thr 172-phosphorylation) mediated through 
AMP and Ca’* influx?* was monitored using «-pThr 172-AMPK. Activation of 
AMPK on its targets was further determined by both «-pSer 36-H2B and by the 
expression level of NAMPT. 

Cell viability assay. HeLa (1 X 10°) cells were reverse-transfected with siRNAs and 
viability was monitored using an RTCA iCELLigence System (ACEA Biosciences). 
HeLa cells were cultured in a humidified incubator with 5% CO, in DMEM me- 
dium (Invitrogen) supplemented with 10% FBS (Invitrogen) and 1X penicillin/ 
streptomycin. 

Preparation of cell fractions. Protocol for cell fractionation was followed as 
described previously’’. Briefly, for whole-cell lysate preparation, HeLa cells were 
dissolved in 1 SDS-polyacrylamide gel electrophoresis (SDS-PAGE) loading buf- 
fer containing 300 mM NaCl. For cytoplasmic fraction preparation, HeLa cells were 
suspended in 0.1 ml of swelling buffer for 6 min on ice, then incubated with 0.1 ml of 
plasma membrane lysis buffer for 5 min on ice. The cells were immediately passed 
through a 21-gauge needle ten times and centrifuged for 10 min at 3220g, 4 °C. The 
supernatant was harvested, while the pellets were used for the preparation of nu- 
clear fractions. For this purpose, the pellet (nuclei) was incubated for 30 min at 4 °C 
with 0.2 ml nuclear extraction buffer. Whenever required, PARP1 (AG14361) and 


PARG inhibitors (ADP-HPD, Millipore) were added to the cell lysis buffer to en- 
sure unwanted PARylation and removal of PAR chains with PARG. 
Immunoprecipitation and Ni-NTA pull-down assays. Protocol for immunopre- 
cipitation was followed as described previously’’. Briefly, the supernatants were 
pre-cleared by incubation with protein G beads. The pre-cleared cell lysates were 
incubated at 4°C for 1h with either «-PARP1, «-ATM, «-DNA-PKcs, «-TyrRS, 
ot-Tip60, or non-immune immunoglobulin-G (IgG), at a concentration of 5 1g ml 
followed by incubation with 30 ul Protein G beads (pretreated with 10 mg ml * 
BSA) at 4 °C for 1 h with rotation. PARP1 (AG14361) and PARG inhibitors (ADP- 
HPD, Millipore) were added to the cell lysis buffer to ensure unwanted PARylation 
and removal of PAR chains with PARG. Immunoprecipitates were washed three 
times, subjected to SDS-PAGE and immunoblotted with specific antibodies. When- 
ever mentioned, the ZZ domain allowed immunoprecipitation of ectopically ex- 
pressed ZZ-PARP1, using anti-IgG. Whenever Ni-NTA pull-down was performed, 
proteins with a 6X -His tag were overexpressed in Escherichia coli. Cells were lysed 
and the supernatant fractions containing the soluble proteins were mixed with HeLa 
cell lysates. For Ni-NTA pull-downs, normal procedures for immunoprecipitation 
were followed with 15-20 mM imidazole in the washing buffer. 

Animal experiments. All experiments were approved by The Scripps Research In- 
stitute Institutional Animal Care and Use Committee (protocol number 13-0002) 
and conducted by the mice facility at The Scripps Research Institute. BALB/cBy] 
mice were originally purchased from Jackson Laboratories. Six-week-old male mice 
were kept with a 12 h light-dark cycle with free access to food and water for 3 days 
before conducting the experiment. For all studies, mice were dosed once and col- 
lected sample after either at 30 min or at 24h. Briefly, activation of PARP1 in mouse 
tissues treated with resveratrol was performed by the intravenous injection of a 
100 il sample of resveratrol (10 1M) in PBS into eight mouse tails (0.012 mgkg™'). 
A 100 pl PBS injection was used as a control in six mice. Tissue samples were col- 
lected from four resveratrol-treated mice and three control mice after 30 min treat- 
ment. Tissue samples from the remaining mice were collected after 24 h. In a different 
experiment, blocking of resveratrol-mediated activation of PARP1 by Tyr-SA was 
analysed with five groups of two mice each, a 100 pl sample of resveratrol (10 uM) 
in PBS alone and, separately, with Tyr-SA (5 1M), Gly-SA (5 1M), cycloheximide 
(5 uM), or the AG14361 PARP1 inhibitor (10 1M) was injected intravenously. A 
100 pl PBS injection with Tyr-SA was used in a control group (two mice) as indi- 
cated. Tissue samples were collected after 30 min treatment. Muscle or heart tissues 
samples were homogenized and analysed by immunoblotting for the status of poly- 
ADP-ribosylation and associated signalling events. 

Recombinant protein purification. Human TyrRS, PARP1 and their variants 
were purified as previously described’*’’. Briefly, DNA encoding either full-length 
TyrRS (amino acids 1-528), mini-TyrRS (amino acids 1-341), dN-TyrRS (amino 
acids 237-528), or CT-TyrRS (amino acids 324-528) was cloned into Nde-I/Hind- 
III sites of pET-20b vector (Novagen). The expressed proteins included a 6-His tag 
from the vector sequence. The full-length PARP1 and its variants (amino (N)- and 
C-terminal domains) were cloned into pET 20b vector. All proteins having a 
C-terminal His-tag were expressed in E. coli strain BL21 (DE3) by induction for 
4h with 1 mM isopropyl B-p-thiogalactopyranoside. Proteins were purified from 
the supernatants of lysed cells using Ni-NTA agarose (Qiagen) column chromato- 
graphy according to the manufacturer’s instructions. We also included an additional 
high-salt two-column wash (1-1.5 M NaCl), to remove the endogenous DNA/RNA 
associated with the purified protein. The NaCl concentration was increased to 1.5 M 
in increments and returned to final elution buffer having 250 mM NaCl in decre- 
ments. All the proteins were subjected to a gel-filtration (S-200) chromatography 
and the protein peak corresponding to homogenous protein was collected. As en- 
dotoxin associated with purified protein interfered with the inhibitory/activating 
effect of RSV on TyrRS/PARP1, the complete protein purification included an addi- 
tional endotoxin removal step. The purified protein was further passed through a 
column containing Detoxi-Gel™ (Pierce-Thermo Scientific) and followed the man- 
ufacturer’s instructions. The final endotoxin level (0.5 EU mg! ml!) was mea- 
sured using an Endosafe®-PTS kit (Charles River Laboratories). The quality of each 
protein purification was validated by SDS-PAGE analysis. 

Protein crystallization and data collection. All the steps were followed as men- 
tioned in the previous report for mini-TyrRS crystallization’®. Briefly, mini-TyrRS 
was mixed with either 1 mM RSV or 2 mM tyrosine and incubated at 4°C for 16 h. 
Before setting up the crystallization trials, the protein samples were subjected to 
high-speed centrifugation (15,871 relative centrifugal force for 15 min) to remove 
all the precipitants and the clear soluble fraction transferred to a new tube. The 
crystallization trials were done using 2.1 M (NH4)2SO4, 0.1 M NaH,PO,/K,HPO, 
(pH 6-8), and 2% acetone at room temperature (22-25 °C). Although, mini-TyrRS 
crystals with L-tyrosine were grown in 3 days (data collected at Stanford Synchro- 
tron Radiation Lightsource), crystals with RSV were grown only after 3-4 months 
at around pH 7. X-ray data were collected at 2.1 A using the in-house X-ray diffrac- 
tion facility (The Scripps Research Institute). Data were integrated and scaled using 
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HKL2000. The electron density was refined by molecular replacement (CNS and 
CCP4/REFMAC suite and Coot) using the known structure of mini-TyrRS (Ex- 
tended Data Table 1) and structures were deposited in PDB (accession numbers 
4Q93 and 4QBT). 

ATP-pyrophosphate exchange assay. Tyrosyl adenylate synthesis was measured 
by using the tyrosine-dependent ATP-pyrophosphate (PPi) exchange assay. A 
mixture containing 100 mM HEPES (pH 7.5), 20mM KCl, 2mM ATP, 1mM 
NaPPi, 2mM DTT, 250-500M L-tyrosine, 10mM MgCl, and about 
0.01 mCiml~! Na[*P]PPi was added to 50-100 nM purified TyrRS (endotoxin 
free, 0.5 EU mg" ml), pre-incubated with 0-1 mM RSV at 4 °C for 30 min. The 
ATP-PPi exchange reaction was incubated at room temperature (22-25 °C), and 
aliquots were removed at specified time intervals and quenched in a mixture con- 
taining 40 mM NaPPi, 1.4% HCI1O,, 0.4% HCl, and 8% (w/v) of activated charcoal. 
After thoroughly mixing, the charcoal was filtered and washed with a solution of 
7% HClO, and 200 mM NaPPi using Spin-X Centrifuge Filters (Corning) contain- 
ing 0.45-um pore-size cellulose acetate filters. After drying, the charcoal was 
punched into scintillation vials and the radioactivity of the ATP bound to the char- 
coal mixture was measured by scintillation counting. 

In vitro PARylation assay. Protocol was as previously described’’. Briefly, 5 pl 
recombinant PARP1 (2 jig) was mixed with 5 yl recombinant TyrRS (0-50 1M) 
and, depending on the experiment, either RSV, TyrSA, DNA, or AG14361 were also 
added (5 ll) and adjusted the volume to 25 tl. This mixture was incubated at 4 °C 
for 15 min, then incubated with 25 12 PAR assay cocktail (50 mM HEPES, pH 7.5, 
100mM KCl, 10mM MgCl, 0.2mM EGTA, 0.1mM EDTA, 40nM NAD, and 
1 pCi [>’P] NAD (Perkin-Elmer)) at 30 °C for 30 min. The RSV concentration used 
in Fig. 2a (middle) was 0, 5, 10, 20, 30, 40, 60, and 80 nM, and in Fig. 2a (bottom) was 
40 nM. The reaction was stopped by addition of 50 jl 2 SDS sample buffer (5% 
SDS) and heated at 95°C for 1 min. The [°?P]-PARylated PARPI was separated 
from free [**P] NAD on SDS-PAGE, vacuum-dried, and the poly-ADP-ribosylation 
activity of PARP1 was analysed using a phosphorimager (Typhoon FLA 7000, GE 
Healthcare). 
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Extended Data Figure 1 | Resveratrol inhibits TyrRS activation. a, The 
ATP-PPi exchange assay as described in Methods demonstrated the inhibitory 
effect of resveratrol on TyrRS and b resveratrol shifts the Michaelis constant 
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(K,») for tyrosine. c, Resveratrol binds TyrRS better than tyrosine. The apparent 
K; for resveratrol was deduced by varying the concentration of RSV and 
plotting the slope of (1/v versus 1/[Tyr]) versus [RSV] as indicated. 
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Extended Data Figure 2 | Resveratrol induces a distinct conformational c, The trans-resveratrol (dark blue) docks (manual) into TyrRS active site 
change upon binding to active site of TyrRS. a, Comparison of the overall —_ without significant structural disturbances. d, Generation of a new pocket 
conformational change induced by resveratrol at the active site of TyrRS by through a RSV-induced conformational change in TyrRS accommodates the 


structure-based superposition (yellow, tyrosine-bound structure; magenta, dihydroxy phenolic ring of RSV (otherwise exposed to the destabilizing 
resveratrol-bound structure). Note the conformational change near the helix aqueous environment in the trans form) and hence facilitates the trans (dark 
region (P331-P342) that connects the linker region with the C-domain. blue) to cis (light blue) conversion of RSV. 


b, Illustration of the extensive interactions of resveratrol with the active site. 
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Extended Data Figure 3 | Resveratrol facilitates the TyrRS-PARP1 
interaction in an active-site-dependent manner. a, Both heat shock (42 °C for 
30 min) and tunicamycin-treatment (10 1g ml *, endoplasmic reticulum 
stress) facilitated the nuclear translocation of TyrRS and activation of PARP1. 
b, Resveratrol or serum starvation facilitate TyrRS interaction with PARP1, 
and Tyr-SA prevents this interaction. ZZ-PARP1 was immunoprecipiated with 
IgG from HeLa cells treated with RSV or serum starvation alone or in 
combination with Tyr-SA. c, Resveratrol- or serum-starvation-mediated 
PARP1 activation is blocked only by Tyr-SA and not by Gly-SA. d, TyrRS 
interacts directly with PARP1. HeLa cell lysate after RSV treatment (5 1M, 
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30 min) was divided into three parts and treated with PARG and catalytically 
inactive PARG-MT. PARP1 was immunoprecipitated and analysed for TyrRS 
interaction. e, Model illustrating the mechanism of RSV-mediated TyrRS 
interaction with PARP1 and subsequent release after auto-PARylation. f, Ni- 
NTA pull-down of N- and C-terminal fragments of PARP1 overexpressed in 
E. coli demonstrates that TyrRS interacts with the C-terminal region of PARP1. 
g, Only the full-length TyrRS (1-528), but none of the various fragments of 
TyrRS (mini-TyrRS (1-364), AN-TyrRS (228-528), or the C-domain 
(328-528)), interacts with PARP1. h, Coomassie blue staining of a gel showing 
the total protein input in for the experiment of Extended Data Fig. 3g. 
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Extended Data Figure 4 | Tyrosyl-AMP analogue (Tyr-SA) does not affect 
DNA-dependent auto-PARylation of PARP1. a, Silver-stained SDS-PAGE 
gel showing the purity and input of PARP1 and TyrRS in the in vitro 
PARylation study of Fig. 2. b, Quantitation (Image J software) of the band 
intensity of PARylated PARP1 in Fig. 2a, top. c, Tyrosyl-AMP analogue 


IB: « PAR 


(Tyr-SA) does not affect DNA-dependent auto-PARylation of PARP1. 

d, Overexpression of nuclear-translocation-weakened mutant of TyrRS° is 
less effective in activating PARP1. e, Y314A-TyrRS is more sensitive to RSV 
than is TyrRS in facilitating PARP1 activation. 
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Extended Data Figure 5 | Resveratrol enhances the acetylation of Tip60 and _ serum-starved cells or RSV-treated samples were compared with untreated 
modulates NAD* concentration in a dose- and time-dependent manner. samples at 1 h. (Although the experiments were done in biological triplicates 
a, Treatment of HeLa cells (1h) with increasing concentration of resveratrol (all samples showing similar results), the error bars in the figure represent the 
enhances the acetylation level of Tip60. Activation of Tip60 was monitored by _ deviations from the mean of the technical triplicates from one representative 


histone acetylation status. b, Total NAD* contents of serum-starved cells or _ biological sample.) e, f, Time course study of poly-ADP-ribosylation status 
RSV-treated samples were compared with untreated samples at 15min usinga —_ and associated signalling events after (e) serum starvation (extended time 
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commercially available BioVision NAD*/NADH quantitation colorimetric course data of the same image shown in Fig. 3c) and (f) treatment with 

kit. c, Total nicotinamide or ADP-ribose produced was deduced from the 5 uM RSV. Using the respective antibodies, Activation of p53 was monitored 
difference in the amount of NAD™ in each sample with respect to the untreated _ by the induction of p21 and SIRT6. Activation of NRF2 was monitored by 
sample (consumption of one mole of NAD* would give rise one mole of HO-1 induction. 


nicotinamide and one mole of ADP-ribose). d, Total NAD* content of the 
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Extended Data Figure 6 | siRNA (siRNA?"®S or siRNA?“???), with and Biosciences). Samples were treated with RSV (5 1M) at 60h and 

without low RSV (51M), does not affect cell viability. HeLa cells (1 X 10°) monitoring was continued for another 2h for siRN A™S (a total of 62h of 
were reverse-transfected with siRNA targeted to TyrRS or PARP1. An monitoring) and for another 16h for siRNA°°” and siRNA?“**? (total 
siRNA“ (a scrambled sequence of siRNA’“®??) was used as a control. 76h monitoring). 

Viability was monitored using the RTCA iCELLigence System (ACEA 
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Extended Data Figure 7 | siRNA‘'*™ did not affect downstream signalling 
events at low RSV (5M). HeLa cells were treated with siRNAS™ for 60h 
to knockdown SIRT 1. HeLa cells were treated with RSV (5 uM) for another 

4 hand samples were collected intervals as indicated. Samples were analysed for 
downstream signalling markers using appropriate antibodies. 
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Extended Data Figure 8 | siRNA- (siRNA™"®S or siRNA?4“"??) treated 
cells did not upregulate the levels of NAD* in response to RSV (5 1M) 
after 1h. HeLa cells (1 X 10°) were reverse transfected, separately, with siRNA 
targeted to PARP1 or TyrRS. A scrambled sequence of target siRNA was used 
as a control. The total NAD* content of RSV (5 1M)-treated samples was 
compared with untreated samples at 1h, using a commercially available 
BioVision NAD*/NADH quantitation colorimetric kit. (Although the 
experiments were done in biological triplicates (all samples showing similar 
results), the error bars in the figure represent the deviations from the mean of 
the technical triplicates from one representative biological sample.) The 
comparator (shown as a dashed bar) is taken from Extended Data Fig. 5d. 
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Extended Data Figure 9 | Resveratrol treatment activates PARP1 and status. e, Resveratrol treatment causes only a transient activation on PARP1. 
associated signalling events in the mouse tissues. a, Activation of PARP1in Immunoblotting of mouse muscle tissue samples after 24h of RSV treatment 
mouse muscle tissue treated with resveratrol monitored by increased showed no significant difference in the level of PARP1°“* compared with 
PARylation and b by increased acetylation status. Activations of Tip60 and control. f, RSV treatment enhances TyrRS interaction with and activation of 
AMPK were monitored by using a-AcK16-H4 and o-pSer36-H2B, PARP1 in the muscle tissue. g and h, Resveratrol-mediated activation of PARP1 
respectively. c, Activation of PARP1 in mouse heart tissue treated with (g, monitored by PARylation status; h, monitored by acetylation status) is 


resveratrol monitored by increased PARylation and d by increased acetylation _ blocked by Tyr-SA in mouse heart tissues. 
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Extended Data Table 1 | Data collection and refinement statistics 


Resveratrol-TyrRS L-Tyrosine-TyrRS 
Data collection 


Space group P21212 P21212 
Cell dimensions 
a,b,c (A) 73.97, 162.55, 35.0 73.97, 162.55, 35.0 
a, By (°) 90, 90, 90 90, 90, 90 
Resolution (A) 37 - 2.1(2.15-2.09)* 34.63 - 2.1(2.15-2.09)* 
Bosc 8t Raews 271 44.7 
T/ol 41 4.6 
Completeness (%) 98.6 98 
Redundancy 32 5.2 
Refinement 
Resolution (A) 21 2.1 
No. reflections 25438 24513 
Rai Ries 0.19/0.24 0.22/0.27 
No. atoms 2955 2858 
Protein 2653 2636 
Ligand/ion 16 12 
Water 284 238 
B-factors 
Protein 37.664 38.199 
Ligand/ion 43.2 42.3 
Water 47.3 48.4 
R.m.s deviations 
Bond lengths (A) 0.018 0.017 
Bond angles (°) 2.017 2.008 


*Highest resolution shell is shown in parenthesis. 


Numerical parameters for the RSV-TyrRS and L-Tyr-TyrRS co-crystals are listed and compared side-by-side in separate columns. 


©2015 Macmillan Publishers Limited. All rights reserved 


| sid al Be 


doi:10.1038/nature14020 


NAD captureSegq indicates NAD as a bacterial cap fora 


subset of regulatory RNAs 


Hana Cahoval*, Marie-Luise Winz'*, Katharina Héfer'*, Gabriele Niibel' & Andres Jaschke! 


A distinctive feature of prokaryotic gene expression is the absence of 
5’-capped RNA. In eukaryotes, 5’,5’-triphosphate-linked 7-methyl- 
guanosine protects messenger RNA from degradation and modulates 
maturation, localization and translation’. Recently, the cofactor nico- 
tinamide adenine dinucleotide (NAD) was reported as a covalent 
modification of bacterial RNA’. Given the central role of NAD in 
redox biochemistry, posttranslational protein modification and 
signalling’, its attachment to RNA indicates that there are unknown 
functions of RNA in these processes and undiscovered pathways in 
RNA metabolism and regulation. The unknown identity of NAD- 
modified RNAs has so far precluded functional analyses. Here we 
identify NAD-linked RNAs from bacteria by chemo-enzymatic cap- 
ture and next-generation sequencing (NAD captureSeq). Among those 
identified, specific regulatory small RNAs (sRNAs) and sRNA-like 
5’-terminal fragments of certain mRNAs are particularly abundant. 
Analogous to a eukaryotic cap, 5’-NAD modification is shown in 
vitro to stabilize RNA against 5’-processing by the RNA-pyropho- 
sphohydrolase RppH’ and against endonucleolytic cleavage by ribo- 
nuclease (RNase) E°. The nudix phosphohydrolase NudC’ decaps 
NAD-RNA and thereby triggers RNase-E-mediated RNA decay, while 
being inactive against triphosphate-RNA. In vivo, ~13% of the abun- 
dant sRNA RNAI is NAD-capped in the presence, and ~26% in the 
absence, of functional NudC. To our knowledge, this is the first 
description of a cap-like structure and a decapping machinery in 
bacteria. 

To isolate NAD-RNA from biological samples, a new capturing tech- 
nique was required, which had to be specific for the NAD moiety without 
damaging the RNA molecules. NAD glycohydrolases appeared suit- 
able but NADase from porcine brain contained RNases, and recom- 
binant human CD38 was not capable of preparative transglycosylation 
on NAD. Adenosine diphosphate-ribosylcyclase (ADPRC) from Aplysia 
californica®, previously described to replace nicotinamide with other 
N-heterocyclic nucleophiles’, was surprisingly found to be capable of 
catalysing the transglycosylation reaction of NAD with alkynyl alcohols. 
ADPRC requires a labile N-glycosidic bond, as in the positively charged 
pyridine, but not in canonical purine and pyrimidine nucleosides’. The 
‘clickable’ transglycosylation product was biotinylated by a copper- 
catalysed azide-alkyne cycloaddition'’”’, for which the overall yield 
was highest using 4-pentyn-1-ol, both with free NAD (Extended Data 
Fig. 1) and with in vitro transcribed 5'-NAD-RNA”. No reaction pro- 
duct was detected with NADH and canonical RNA. After capturing the 
biotin-linked RNAs on streptavidin beads, they were selectively amp- 
lified by a protocol combining 3'-adaptor ligation, reverse transcrip- 
tion, tailing of complementary DNA, ligation of a second adaptor and 
polymerase chain reaction (PCR). We applied this workflow (NAD 
captureSeq; Fig. 1) to total RNA from either a modified Escherichia coli 
JM109 laboratory strain or a more wild-type-like E. coli K-12 strain to 
construct three different next-generation sequencing (NGS) libraries 
(Extended Data Fig. 2a-c). 

In all three data sets (Supplementary Data 1-3), comparison of reads 
from samples (B), which were fully treated, versus controls (A), which 


lacked ADPRC, allowed a clear identification of enriched RNAs among 
the most abundant sequences (Fig. 2 and Extended Data Figs 2d-i, 3). 
Many enriched candidates were sRNAs reported to act by different 
pathways'*”*, for example, GadY, GcvB, ChiX, McaS, RNAI and CopA 
(Fig. 2 and Extended Data Table 1). The latter two were observed only 
in JM109, as they are replication control RNAs encoded on plasmids 
absent in our K-12 strain’®’’. Other sRNAs were found not to be enriched. 
Another set of enriched sequences represented 5’-fragments of differ- 
ent mRNAs, for example, of genes gatY, pgk, hdeD, and of the leader 
peptide genes ilvL and hisL (Extended Data Fig. 3). Besides the leader 
peptides with regulatory functions, several of the encoded enzymes are 
involved in cellular metabolism (for example, PGK"*). Other enzymes 
and several enriched sRNAs act in stress responses (for example, res- 
ponse to acid, enzyme HdeD”, and sRNAs GadY™*, GcvB” and ChiX”', 
or response to ultraviolet stress, enzyme UspE”). In contrast, no ribosomal 
RNA or transfer RNA was enriched. Apart from annotated transcrip- 
tion starts at adenosine for most enriched genes, no common sequence 
or structural features were identified for the enriched RNAs in promo- 
ters, transcripts, or reported transcription and o factors. 

To verify that the enrichment observed in NAD captureSeq repre- 
sents a difference in cDNA amounts rather than PCR bias during NGS 
library preparation, we performed real-time PCR with cDNA collected 
after the second adaptor ligation, before PCR. As reference RNAs, we 
chose species that were detected but not enriched in NGS (Extended 
Data Fig. 4a). These data supported our results: after normalization to 
tRNA*“ and 5S rRNA, RNAI and CopA RNA were enriched >500-fold, 
while four other sRNAs yielded 24- to 163-fold enrichments. To confirm 
that—despite the lack of a fragmentation step in NAD captureSeq— 
specific 5'-terminal mRNA fragments, rather than full-length mRNAs 
were enriched, real-time PCRs were performed with primers target- 
ing different loci within the transcript. Up to ~400-fold enrichment 
of mRNA 5’-termini was corroborated, whereas primers targeting the 
middle of those mRNAs yielded no or decreased enrichment (Extended 
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Figure 1 | NAD captureSeq protocol applied to total RNA. CuAAC, copper- 
catalysed azide-alkyne cycloaddition; pentynol, 4-pentyn-1-ol. 
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Figure 2 | Analysis of enriched RNAs by NGS. a, b, Abundance of RNAs in 
fully treated sample (B1) versus minus ADPRC control (A1) found by NGS 
analysis of JM109 samples (120-160 bp) (reads per million (RPM), average of 
forward and reverse read counts). Triangles, enriched; circles, non-enriched. 


Data Fig. 4b, d). Additionally, northern blot analysis of non-treated total 
RNA (Extended Data Fig. 4c) confirmed the presence of short 5'- 
terminal fragments, indicating that they are not generated during NAD 
captureSeq, but rather represent aborted or processed transcripts or 
hitherto unknown sRNAs. These 5’-fragments exist in an NAD-modified 
state and may have additional (for example, regulatory) roles. 

To prove that RNAs identified by NAD captureSeq do indeed bear 
the NAD modification and have not been isolated for other reasons, 
highly abundant RNAI’”” and 5S rRNA were pulled-down from total 
RNA, nuclease P1 digested and analysed by high performance liquid 
chromatography mass spectrometry (HPLC-MS)*™. As expected, the 
extracted ion chromatogram for the species with the mass to charge ratio 
(m/z) = 540.05 ([M-H]  ) corresponding to the main NAD fragment 
formed during MS ionization’, revealed a characteristic peak for in vitro 
transcribed model NAD-RNA. This peak was also detected for cellular 
RNAI, but not for the 5S rRNA control (Fig. 3a, b), for which similar 
amounts of the canonical nucleotides, but no NAD or NAD fragments, 
were measured. The peak disappeared upon treatment with ADPRC 
and 4-pentyn-1-ol before digestion, while a new peak (m/z = 624.10) 
appeared, representing the reaction product. For the less abundant 
sRNA GcvB, the HPLC-MS protocol was modified to detect the [M] e 
ion of NAD, m/z = 664.116, which was—again—not present in the 5S 
rRNA control (Extended Data Fig. 5). 

The 5’-modification status of RNA (triphosphate versus monopho- 
sphate) was recently reported to determine the stability and processing 
of RNAs themselves® and of sRNA targets”. We speculated that the 
NAD modification may act in a similar way to the eukaryotic cap and 
stabilize regulatory RNAs. Indeed, in vitro transcribed NAD-RNAI is 
strongly stabilized (18.5-fold compared with 5’-monophosphate-RNAI 
and 3.5-fold compared with 5’-triphosphate-RNAI; Fig. 3c-e) against 
RNase E, the enzyme that initiates many decay pathways in E. coli. 

RNA pyrophosphohydrolase (RppH) has been described to convert 
5'-triphosphate-RNA to 5'-monophosphate-RNA, and thereby trigger 
RNase-E-mediated decay’. We observe that NAD modification strongly 
(60-fold) decelerates RNAI processing by RppH, while a less pronounced 
effect is observed for NADH (sixfold; Fig. 3f, g). RppH is one of 13 nudix 
phosphohydrolases**”’ described in E. coli. Another member of this 
class is NudC, which hydrolyses NAD(H) into NMN(H) and AMP’. We 


Initial capitals indicate sRNAs. c, NGS analysis of RNAI sequences found in 
controls (A1, red) and samples (B1, green) in forward (fwd) and reverse (rev) 
reads. d, NGS analysis of 5S rRNA sequences. 


find that this enzyme is highly active in decapping 5'-NAD(H)-RNAI 
but is virtually inactive against 5'-triphosphate-RNAI (Fig. 3h, i). A 
NudC mutant in which a critical residue in the nudix motifis exchanged 
(NudC(E178Q)) was completely inactive against transcribed 5’-NAD- 
RNAI (Extended Data Fig. 6a, b). 5’-NAD-RNAI treated with NudC 
for 20 min is subsequently processed by RNase E rapidly and at a rate 
comparable with 5’-triphosphate-RNAI treated with RppH, while the 
other enzyme-substrate combinations result in much slower degrada- 
tion (Extended Data Fig. 6c, d, compare with Fig. 3d, e). 

To examine the biological significance of the NAD cap and to sub- 
stantiate the enzymatic decapping activity, we subjected cellular total 
RNA to NudC decapping followed by splinted ligation in vitro (Extended 
Data Fig. 7a). We found that RNAI was decapped by wild-type NudC, 
but not by the E178Q mutant, in a time-dependent manner (Extended 
Data Fig. 7b, c). These findings provide further support for the pres- 
ence of NAD-capped RNA, and for the 5’-terminus as the attachment 
site’ in vivo. Using this knowledge, we determined the percentage of 
NAD modification for RNAI by NudC decapping of dephosphorylated 
total RNA and specific ligation of NudC-decapped RNAI sequences 
(Fig. 4a, b and Extended Data Fig. 7d-h). This analysis revealed that 
~13% of full-length RNAI contained the NAD cap, whereas 6S RNA, 
which was not enriched in NAD captureSeq, tested negative. Upon chro- 
mosomal nudC deletion, the NAD modification level of full-length 
RNAI was doubled to ~26%. Complementation of this defect in trans 
bya plasmid-borne copy of wild-type nudC but not by an inactive mutant 
(nudC E178Q) reduced the NAD modification back to wild-type level. 
Despite these differences in NAD modification state between wild type 
and the AnudC knockout strain, no effect on the half-life of RNAI or 
GcvB RNA was measured in vivo when transcription was blocked by 
addition of the antibiotic rifampicin (Extended Data Fig. 6e-h). These 
data suggest that further, currently unknown, players are involved in 
5'-NAD-RNA processing, or that only a subset of NAD-RNAs is pro- 
cessed by NudC in vivo. Our observations are strikingly similar to a recent 
report on 5’-triphosphate processing of sRNAs, in which knockout of 
RppH, the enzyme generally accepted to perform 5’-triphosphate 
removal, did not result in major changes in 5’-modification status”. 
These combined findings highlight the complexity of RNA processing, 
which is still poorly understood even in the model organism E. coli. 
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Figure 3 | Characterization of NAD-RNAI. a, HPLC-MS extracted ion 
chromatogram (m/z = 540.05) of nuclease-P1-digested model NAD-RNA, 

5S rRNA and RNAL. b, High-resolution MS of HPLC fractions eluting at 

5-6 min in a. c, Schematic representation of RNase E degradation and 
RppH/NudC cleavage of in vitro transcribed RNAI with different radiolabelled 
(red asterisks) 5'-termini (P, phosphate; NPPA = NAD). d-i, 5'-Dependent 
RNase E degradation (d, e), RppH processing (f, g) and NudC decapping (nh, i) 
of RNAI. In d, radioactive full-length bands disappear due to cleavage. 

In f, h, radiolabels are lost due to 5'-end processing and 5’-NAD(H) samples 
contained alkaline phosphatase to remove **P-phosphate accessible after 
RppH/NudC processing. Data points in e, g, and irepresent mean + standard 
deviation (s.d.), n = 2. 


To our knowledge, this work represents the first demonstration ofa 
cap-like structure in prokaryotes, which modulates the susceptibility of 
specific RNAs towards RNase E and phosphohydrolases. The decap- 
ping activity of NudC could provide the bacterium with a mechanism 
to selectively initiate degradation for a subset of cellular RNAs orthogonal 
to RppH processing of 5’-triphosphate-RNA (Fig. 4c). In eukaryotes, 
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Figure 4 | Quantification of NAD modification of RNAI- and NAD- 
decapping activity of NudC in E. coli. a, 5'-Modification state of full-length 
RNAI (FL-RNAI) in vivo in the presence or absence of functional NudC 

(see Extended Data Fig. 7 for details). For each strain: n = 3 (independent 
cultures); mean + s.d. b, Exemplary northern blots of RNAI and 6S RNA for 
the quantification of NAD modification. 65 RNA (not enriched in NAD 
captureSeq (Extended Data Table 1)) tested negative. No ligation product was 
obtained upon treatment of dephosphorylated total RNA with dysfunctional 
NudC(E178Q). tRNA** served as loading control. c, Alternative RNase-E- 
dependent RNA degradation pathways. PP;, pyrophosphate; NMN, 
nicotinamide mononucleotide. 


nudix phosphohydrolases are known to act as decapping enzymes, but 
such activity was hitherto unknown in prokaryotes”. Since 5’ modi- 
fication with monophosphate was recently shown to modulate RNase- 
E-mediated degradation of sRNA targets”, it is possible that the NAD 
cap influences not only the turnover, but also the function, of NAD-RNAs. 

In addition to stabilization and specific degradation, NAD capping 
could serve to recruit specific proteins to the RNA 5’-end, as NAD is 
among the most common protein ligands, with well-characterized bind- 
ing pockets in many proteins”*. This could endow the RNAs or associated 
molecular machines (for example, the degradosome”’) with additional 
structural, catalytic or regulatory properties. As many NAD-binding 
proteins distinguish between reduced and oxidized NAD”, associated 
proteins may differ depending on redox state. Besides the identification 
of further players in NAD-RNA processing, future work will focus on 
the mechanism of NAD capping and different potential functions of 
the NAD cap to reveal the scope of NAD capping in prokaryotic biology. 


Online Content Methods, along with any additional Extended Data display items 
and Source Data, are available in the online version of the paper; references unique 
to these sections appear only in the online paper. 
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METHODS 


General. Reagents were purchased from Sigma-Aldrich and used without further 
purification. Oligonucleotides were purchased from Biomers.net GmbH and Inte- 
grated DNA Technologies. MS experiments were performed on a Bruker micro- 
TOFQII ESI mass spectrometer. Recorded MS spectra were deconvoluted using 
maximum entropy deconvolution. For high-resolution mass spectra, internal cali- 
bration was performed (enhanced quadratic mode) using ESI Tune-mix (Fluka) as 
calibrant. Reversed-phase HPLC purification was performed on an Agilent 1100 
Series HPLC system equipped with a diode array detector using Phenomenex Luna 
5 um C18 110 A (250 X 4.60 mm) at a flow rate of 1 ml min’ and eluting with a 
gradient of 100 mM triethylammonium acetate pH 7.0 (buffer A) and 100 mM 
triethylammonium acetate in 80% acetonitrile (buffer B). Denaturing polyacryla- 
mide gels were stained with SYBR Gold and visualized by a Typhoon 9400 imaging 
system. 

Reaction of A. californica ADPRC with NAD and various alcohols. Reactions 
were performed on a 30 il scale in 50 mM HEPES buffer (pH 7) containing 5 mM 
MgCl, at 37 °C for 30 min. NAD (10 mM) was mixed with ADPRC (Sigma-Aldrich, 
1.4M) and 2 ul of 2-propyn-1-ol, 3-butyn-1-ol, 4-pentyn-1-ol or 5-hexyn-1-ol. 
After the reaction, 2 1l of the reaction mixtures were diluted, filtered and analysed 
by HPLC. The products were isolated and characterized by electrospray ionization 
(ESI)-MS and high-resolution MS analysis. 

CuAAC. The product of the ADPRC reaction with NAD (10 pl) was isolated by 
HPLC, freeze-dried, mixed with biotin-PEG;-azide (Jena Bioscience; three equiva- 
lents, 10 mM), anda freshly prepared mixture of CuSO, (5 mM) and sodium ascor- 
bate (10 mM) in 30 tl of 50 mM HEPES buffer (pH 7) containing 5mM MgCh, 
and incubated at 37 °C for 30 min. The products were isolated by HPLC and char- 
acterized by ESI-MS and high-resolution MS analysis. 

Preparation of model NAD-RNA by in vitro transcription. For in vitro transcrip- 
tion we used a modification of a previously described method": 0.2 UM of double- 
stranded DNA (dsDNA) template in transcription buffer (40 mM Tris, 1 mM 
spermidine, 22mM MgCl, 0.01% Triton-X-100, 5mM dithiothreitol (DTT), 
5% DMSO, pH 8.1), along with 1 mM of each of GTP, ATP, CTP and UTP, with 
4mM NAD (NAD-RNA) or without NAD (unmodified RNA) was mixed in an 
Eppendorf tube. Then, 5.2 U pl ' of T7 RNA polymerase was added (laboratory- 
prepared stock) and the transcription mixture was incubated at 37 °C for 4h. The 
reaction was stopped by addition of gel loading buffer (10% TBE in formamide 
containing xylene cyanol and bromophenol blue) and purified over a 12% dena- 
turing polyacrylamide gel using standard electrophoresis conditions (1 TBE buffer, 
run at 25 W for 3h). Transcription products were excised and eluted in 0.3 M 
sodium acetate (pH 5.5) overnight at room temperature. The eluted solution was 
ethanol precipitated and then dissolved in Millipore water. 

Application of the capture protocol to model NAD-RNA. The transcribed RNA 
(NAD-RNA or non-modified RNA, 0.35 nmol) was mixed with 3 ul of 4-pentyn- 
1-oland ADPRC (0.4 LM) in 60 pl of 50 mM HEPES and 5 mM MgCl (pH 7) and 
incubated at 37°C for 30 min. The RNA was purified by ethanol precipitation, 
one-third was kept for PAGE analysis and two-thirds were used in the subsequent 
CuAAC reaction. Biotin-PEG3-azide (29 JM, calculated as 10 equivalents to RNA) 
and a freshly prepared mixture of CuSO, (1.25 mM), tris(3-hydroxypropyltriazo- 
lylmethyl)amine (THPTA; 0.625 mM) and sodium ascorbate (2.5 mM) were mixed 
with purified RNA in 80 ul of 50 mM HEPES and 5 mM MgCl (pH 7) and incu- 
bated at 25 °C for 30 min. Half of the sample was then mixed with streptavidin 
solution (New England Biolabs) at a ratio of 1:1 to added biotin. The samples were 
mixed with gel loading buffer (10% TBE in formamide containing xylene cyanol 
and bromophenol blue) and analysed by electrophoresis on a 16% denaturing 
polyacrylamide gel. 

Plasmids and E. coli strains. The nudC gene was PCR amplified from genomic 
DNA of E. coli K-12 (isolated via GenElute Bacterial Genomic DNA Kit, Sigma- 
Aldrich). Xbal and NotI restriction sites were introduced during amplification using 
the primers Fwd NudC and Rev NudC (Supplementary Table 1). The resulting PCR 
product was digested with Xbal and NotI and cloned into pET-28c vector (Merck 
Millipore). After Sanger sequencing, the resulting plasmid pET-28c-NudC was trans- 
formed into E. coli One Shot BL21 Star (DE3) (Life Technologies). The NudC(E178Q) 
mutant was generated by site-directed mutagenesis using a procedure based on the 
Phusion Site-Directed Mutagenesis Kit (Thermo Scientific). The resulting plasmid 
pET-28c-NudC(E178Q) was sequenced and transformed into E. coli One Shot 
BL21 Star (DE3). 

For complementation of the nudC knockout in E. coli JW5548-1 (ref. 31) (E. coli 
Genetic Stock Center (CGSC)), plasmids pTACNudC and pTACNudC(E178Q) 
that carry nudC or nudC E178Q were prepared. nudC or nudC E178Q were ampli- 
fied from plasmid pET-28c-NudC or pET-28c-NudC(E178Q). EcoRI and SphI 
restriction sites were introduced during amplification and the PCR product cloned 
into the pTAC-MAT-2 vector (Sigma). Sequenced plasmids were transformed into 
E. coli JW5548-1. 


To create analogous strains that express RNAI, plasmid pHSG 398 (TaKaRa) 
was subjected to site-directed mutagenesis to obtain the promoter contained in the 
pUC19 plasmid. pHSG 398-G_,-A was then transformed into E. coli BW25113 
(wild type) (CGSC) E. coli JW5548-1 (AnudC) and E. coli JW5548-1 containing 
plACNudC (AnudC + NudC) or pTACNudC(E178Q) (AnudC + NudC(E178Q)). 
All strains used and generated in this work are summarized in Extended Data Fig. 7. 
Bacterial RNA isolation. Total RNA from E. coli K-12 (strain DSM 498; DSMZ), 
E. coli JM109 (F+ strain; Promega) transformed with pUC19 plasmid, E. coli 
BW25113, E. coli JW5548-1 (AnudC) and all strains derived thereof was isolated 
using RNAzol (Molecular Research Center). Bacteria were grown at 37°C in LB 
broth until late exponential phase (OD¢09 nm = 2.0). Expression of NudC in strains 
containing pTACNudC or pTACNudC(E178Q) was induced with 1 mM isopropyl- 
B-p-thiogalactoside (IPTG) at OD¢00 nm = 0.8. To prevent RNA degradation, 40 ml 
of culture were mixed with 5 ml ice-cold EtOH/phenol stop solution (5% water- 
saturated phenol, pH < 7.0), and bacteria were harvested by centrifugation at 
1,800g at 4 °C for 20 min. The pellet was resuspended in 600 jl lysozyme solution 
(2mg ml | in water) and incubated for 5 min at 37 °C. RNAzol (6 ml) was added 
to the lysate to isolate total RNA according to the supplier’s information. In addi- 
tion, total RNA was DNase I (Thermo Scientific) as well as proteinase K (Sigma 
Aldrich) digested. DNase I (0.5 U ug * of total RNA) digestion was performed at 
37 °C for 30 min and stopped by the addition of proteinase K (100 pg ml’), 0.5% 
(w/v) SDS and 5mM CaCl, at 37 °C for 15 min. Finally, total RNA was phenol/ 
chloroform extracted twice and ethanol precipitated. 

Capture protocol for NAD-RNA from total RNA. Three samples of total RNA 
(100 1g) were treated with 4-pentyn-1-ol (10 pl, minus ADPRC control) or ADPRC 
(0.85 LM, minus 4-pentyn-1-ol control), or with both 4-pentyn-1-ol and ADPRC 
(fully treated sample), in 100 pl of 50 mM HEPES and 5mM MgCl, (pH 7) and 
incubated at 37 °C for 30 min. The reaction was stopped by phenol/ether extrac- 
tion and RNA was ethanol precipitated. The subsequent CuAAC reaction was per- 
formed for all samples on a 100 pil scale in 50 mM HEPES and 5 mM MgCl, (pH7) 
with biotin-PEG3-azide (250 11M) and a freshly prepared mixture of CuSO, (1 mM), 
THPTA (0.5 mM) and sodium ascorbate (2 mM) for 30 min at 25 °C. Samples were 
purified by phenol/ether extraction and ethanol precipitated. Mobicol Classic col- 
umns were assembled with small filters (MoBiTec, GmbH), and 50 ul of strepta- 
vidin immobilized on agarose (CL-4B, Sigma Aldrich) for each sample and were 
washed three times with 100 l of immobilization buffer (10 mM HEPES, 1 M NaCl, 
5mM EDTA, pH 7.2). All further washes were performed three times, with the same 
volume, except for washes with streptavidin washing buffer (five times). The beads 
were then blocked with 100 il solution of 100 1g ml” ' BSA (New England Biolabs) 
in immobilization buffer at room temperature for 20 min at 1,000 r.p.m. After wash- 
ing the beads, each RNA sample, dissolved in 40 ul immobilization buffer, was incu- 
bated with beads at room temperature at 1,000 r.p.m. for 60 min. The beads were 
washed with streptavidin washing buffer (50 mM Tris-HCl, 8 M urea, pH 7.4) and 
with 1X ligation buffer (50 mM Tris-HCl, 10 mM MgCl, 15% dimethylsulphoxide 
(DMSO), 50 pg ml "BSA, pH 7.4). The beads were again blocked as described earlier. 
Ligation and amplification protocol. The beads were covered with 20 il of ligation 
mixture and incubated at 4 °C for 16 h. The ligation mixture contained adenylated 
3'-adaptor*? (5 uM, Adaptor A and AdaptorB; desalted), 50 mM 2-mercaptoethanol, 
T4RNA ligase (0.5 U wt; Thermo Scientific), T4 RNA ligase 2, truncated (10 U ul; 
New England Biolabs) in ligation buffer. The beads were then incubated with 10 pl 
of 5 M NaCl for 1 hat room temperature at 1,000 r.p.m. (rebinding to streptavidin) 
and washed first with streptavidin washing buffer, then with 1X first strand (FS) 
buffer (Life Technologies) and again blocked in 100 1] solution of 100 jig ml~' BSA 
in 1X FS buffer, as described earlier. The beads were then covered with 20 ul of 
reverse transcription mixture in 1X FS buffer containing 0.5mM dNTPs, BSA 
(50 pg ml '),5mM DTT and reverse primer (5 uM, RTprimerA or RTprimerB), 
shaking at 1,000 r-p.m. for 15 min at room temperature. Then SuperScript II Reverse 
Transcriptase (10 U pl” | Life Technologies) was added and the mixture was incu- 
bated at 37 °C without shaking for 60 min. The rebinding procedure was repeated 
(+10 pl of 5 M NaCl for 1h at room temperature shaking at 1,000 r.p.m.) and the 
beads were washed with streptavidin washing buffer and with 1% exonuclease I 
(Exo I) buffer (Thermo Scientific). The beads were again blocked with 100 pl solu- 
tion of 100 pg ml | BSA in 1X Exo I buffer (Thermo Scientific) as described earlier, 
and again washed with 1x Exo I buffer. An exonuclease I digest was performed to 
remove free single-stranded DNA (ssDNA) primers. Twenty microlitres of 1X Exo 
I buffer with exonuclease I (1 U pl’; Thermo Scientific) were incubated with beads 
for 30 min at 37 °C. After digestion, the beads were again washed with streptavidin 
washing buffer and with immobilization buffer. In the next step the RNA was 
digested in 100 ull of 0.15 M NaOH solution without shaking for 25 min at 55 °C. 
The spin columns were centrifuged and washed with 100 ul of HO. The flow- 
through was ethanol precipitated. The obtained cDNA was dissolved in CTP tail- 
ing mixture containing 1.25 mM CTP, 1X TdT buffer and terminal deoxynucleo- 
tidyl transferase (1 U pl~ |. Thermo Scientific) in a volume of 20 pl, and incubated 
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for 30 min at 37 °C. The enzyme was thermally denatured by heating to 70 °C for 
10 min. A 20 ul mixture for the ligation of the 2nd adaptor (containing 100 mM 
Tris-HCl, 20 mM MgCl, anchor (5 1M, AnchorA annealed with AnchorB), ATP 
(10 1M) and T4 DNA ligase (1.5 Weiss U pl ~ |. Thermo Scientific) was added and 
the solution was incubated for 16 h at 4 °C. The tailed and ligated CDNA was used 
further in all experiments. 

NGS. cDNA (fully treated samples, minus ADPRC controls) was PCR amplified. 
The regions corresponding to 120-160 bp for sets 1 and 2, or 50-300 bp for set 3 of 
both minus ADPRC control (A) and fully treated sample (B), were extracted from 
a 2% agarose gel, purified and either first amplified by PCR, or directly barcoded in 
a PCR with barcoding primers (Supplementary Table 1). Multiplexed samples 
were submitted to the CellNetworks Deep Sequencing Core facility for library syn- 
thesis using the gDNA kit (New England Biolabs) with TruSeq adaptors from the 
TruSeq RNA prep kit (lumina) and sequencing on a HiSeq2000 using the 100 bp 
single end protocol. 

Analysis of NGS data. Using the program RobiNA”’, the barcoded sequencing 
reads of different samples and controls were demultiplexed. Furthermore, forward 
and reverse reads were separated according to the adaptor sequence present at the 
5’-end of each read. Hereby, we were able to recognize whether a sequencing read 
was in the original orientation (that is, as on the RNA level) and differentiate between 
reads that belonged to the 5’- and 3'-ends of those original RNA sequences. This 
analysis enabled the use of strand-specific feature extraction methods later on in 
the workflow. During the process, we clipped the barcode and adaptor sequences. 
We performed all further sequence analysis using the web-based sequence-analysis 
platform GeneProf™*. To this end, the individual barcode- and orientation-separated 
data sets were uploaded to the platform and the FASTX-Toolkit (http://hannonlab. 
cshl.edu/fastx_toolkit/): Reverse Complement was used to revert sequences that 
were in reverse orientation, representing RNA 3'-termini. Furthermore, using the 
Modify and Filter Sequence tool, we filtered reads according to quality, and removed 
the additional nucleotides that were added to the cDNA sequence during the tail- 
ing reaction, which are thus present as starting Gs at the 5’-termini of forward 
reads, and, for data set 3, removed adaptor sequences that were present at the 3’- 
end of certain reads (due to the fact that some PCR products were shorter than the 
100 bp read length). A custom reference set for a pUC19-containing E. coli strain 
K-12 was created using reference data from Ensembl” Bacteria FTP (ASM584v1.16) 
in which the F-plasmid contained in the JM109 strain was included. We inserted 
the pUC19 plasmid sequence (GenBank accession L09137.2) manually. For feature 
annotation, we used the Ensemb! Biomart utility. 

Wealigned all modified forward and reverse reads to the reference data set using 
the Bowtie** alignment tool and performed further analysis using the Quantitate 
Gene Expression tool. For each sample or control, average read numbers as aver- 
age of normalized read counts (in RPM) from forward and reverse reads were cal- 
culated. Enrichment factors were determined as ratio of average read counts of 
sample B versus control A (Supplementary Data 1-3). For each data set, relative 
abundance of different reads in sample B was plotted against relative abundance in 
control A. This analysis revealed that the most abundant reads were clearly sepa- 
rated into an enriched and a non-enriched sequence population. Abundance thresh- 
olds Tata set: above which this separation was possible, were determined for each 
data set separately, as T, = 1,000, T, = T3 = 200. The most abundant sequences 
were identified as those where the sum of reads in A and B= Tata set. Candidate 
sequences were inspected in the Integrated Genome Browser (IGB)”’ and visua- 
lized using the GeneProf genome browser. Hits were compared to EcoCyc** anno- 
tations for transcription site annotation and further reference. 

Real-time PCR. Quantitative real-time PCR (qPCR) was used to validate the enrich- 
ment of sequences identified in the deep sequencing data. All reactions were per- 
formed in duplicate in 20 ll scale. Three microlitres cDNA (diluted 1:40 for sRNA 
and 1:50 for mRNAs) was used as a template for qPCR, which was performed in a 
Light Cycler 480 instrument (Roche) using the Brilliant III Ultra-Fast SYBR Green 
qPCR Mastermix (Agilent). Analysis of the data was performed with the Light- 
Cycler 480 Software. Fully treated sample cDNA experiments were compared 
with the same experiments performed with minus ADPRC control cDNA and 
Millipore water was used as a negative control. The primers were designed accord- 
ing to results from deep sequencing analysis for specific target sequences (Sup- 
plementary Table 1). 

Pull-down of RNAI, GcvB and 5S rRNA. For analysis of endogenous RNAs we 
used a modification of a previously described method**. Total RNA prepared as 
described earlier (for LC-MS of RNAI from E. coli JM109, OD¢00 nm = 2.0; for LC- 
MS of GcvB from E. coli K-12, OD¢00 nm = 1.0) was size-separated by polyacryla- 
mide gel (8%) electrophoresis, and only the region containing RNA of sizes between 
75 and 200 nucleotides was eluted and ethanol precipitated. To streptavidin agarose 
slurry (100 ul, Sigma-Aldrich), washed 2X with 1X PBS buffer (100 il), biotiny- 
lated Hybri-probes (5 nmol) (Supplementary Table 1) were added and incubated 
at 25 °C, 500 r.p.m. for 10 min. The slurry was washed again two times with 1 PBS 
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buffer (100 ul), equilibrated two times with pull-down buffer (100 ul, 10 mM Tris- 
HCl, 0.9 M tetramethylammonium chloride, 0.1 M EDTA, pH 7.8), incubated with 
250 pg of RNA (75-200 nucleotides) in pull-down buffer at 65 °C for 10 min and 
then cooled down to room temperature over 20 min. The slurry was then transferred 
to Mobicol Classic columns and washed six times with 10 mM Tris-HCl (100 ul, 
pH 7). The hybridized RNA was released by suspending the slurry in 100 pl of 
Millipore water, heating of the slurry four times to 75 °C for 10 min, ethanol precipi- 
tated, the RNA was further purified over polyacrylamide gel (10%) electrophoresis. 
DNase I digestion. RNAI, purified by pull-down, was further treated with DNase 
I to digest biotinylated hybridization probe. The ethanol-precipitated pull-down 
RNA (equivalent to 100 ig of 75-200-nucleotide RNA fraction) was dissolved in 
20 ll of DNase I buffer containing 0.25 U pl’ of DNase I. The reaction mixture 
was incubated at 37°C for 15 min and the enzyme was thermally denatured by 
heating to 65 °C for 10 min. RNA was ethanol precipitated before PAGE analysis. 
Nuclease P1 digestion. RNAI, GcvB or 5S rRNA pull-down from total RNA was 
digested by nuclease P1 (0.1 U pl’; Sigma-Aldrich) in 100 pl of 50 mM ammonium 
acetate buffer (pH 4.5) at 37°C for 1h. The digest was purified over Amicon- 
Millipore filters 10 kDa (Merck Millipore). The flow-through was concentrated on 
a Speedvac system to ~20 ll for LC-MS analysis. The nucleotides subjected to LC- 
MS measurements in positive mode were dephosphorylated with FastAP (Thermo 
Scientific; 1 U pmol” * RNA, 1 hat 37 °C) and again purified over Amicon-Millipore 
filters 3 kDa (Merck Millipore). 

HPLC purification of dephosphorylated digest for LC-MS in positive mode. 
HPLC purification was performed on an Agilent 1200 Series HPLC system equipped 
with a multi-wavelength detector using a Phenomenex Synergi Fusion-RP column 
(4 um, 4.6 X 150 mm) at a flow rate of 0.35 ml min’ and eluting with a gradient 
of 5mM ammonium acetate pH 8.3/acetonitrile. Samples were collected at 17.5- 
18.5 min and applied to HPLC-MS analysis. 

HPLC-MS analysis. HPLC-MS experiments were performed on a Bruker 
microTOFQ-II ESI mass spectrometer connected to an Agilent 1200 Series HPLC 
system equipped with a multi-wavelength detector. A Phenomenex Kinetex C18 
column (2.6 jum, 2.1 X 100 mm) was used at a flow rate of 0.2 ml min‘ usinga gra- 
dient of 0.1% aqueous ammonium formate/methanol (LC-MS grade) for nuclease- 
P1-treated RNA or 100 mM hexafluoroisopropanol/8.6 mM triethylamine pH 8.3 
and methanol (LC-MS grade) for the dephosphorylated digest. ESI was used with a 
capillary voltage of 4,500 V, end plate offset —500 V and collision voltage of 10 eV. 
The drying gas temperature was 190 °C, the drying gas flow rate was 81 min‘ and 
the detector was operated in negative ion mode (RNAI, 5S RNA) or positive ion 
mode (GcvB, 5S RNA). Analysis of the LC-MS measurements was carried out 
using Hyphenation Star PP (Version 3.2.44.0) and DataAnalysis (version 4.0, SP 4) 
software (Bruker Daltonics). 

Overexpression and affinity purification of RNase E(1-529), NudC and 
NudC(E178Q). The RNase E catalytic domain (residues 1-529) (supplied by 
B. Luisi) was overexpressed and purified as previously described”. IPTG-induced 
E. coli One Shot BL21 Star (DE3) containing the respective plasmid were harvested 
and lysed by sonication (30 s, 50% power, five times) in HisTrap buffer A (50 mM 
Tris-HCl pH 7.8, 1 M NaCl, 5 mM MgSO,, 5 mM 2-mercaptoethanol, 5% glycerol, 
5 mM imidazole, 1 tablet per 500 ml complete EDTA-free protease inhibitor cock- 
tail (Roche). The lysate was clarified by centrifugation (37,500, 30 min, 4 °C) and 
the supernatant was applied to a Ni-NTA HisTrap column (GE Healthcare). The 
protein was eluted with a gradient of HisTrap buffer B (HisTrap buffer A with 
500 mM imidazole) and analysed by SDS-PAGE. Fractions of interest were con- 
centrated in a micro filter tube (CO 10 kDa, centrifugation at 2,000 r.p.m., 4°C) 
and stored in 50 mM Tris-HCl, pH 7.9, 0.5M NaCl, 50mM KCl, 1mM MgCh, 
5 mM DTT, 50% (v/v) glycerol, 1 tablet per 500 ml complete EDTA-free protease 
inhibitor cocktail. 

E. coli One Shot BL21 Star (DE3) containing pET28c-NudC or pET28c- 
NudC(E178Q), E. coli JW5548-1 pTACNudC or E. coli JW5548-1 containing 
pT ACNudC(E178Q) were induced with 1 mM IPTG for 3h. The harvested cells 
were lysed by sonication and the His-tagged enzymes were purified similar to 
RNase E. All purified protein samples were >95% pure, judging from SDS-PAGE. 
Preparation of different 5’-terminally labelled RNAs for in vitro cleavage assays. 
Specifically 5’-terminally radiolabelled RNAI, containing either 5'-monophosphate, 
5'-triphosphate, 5’-NAD or 5'-NADH was obtained by in vitro transcription. Asa 
template for in vitro transcription, the sequence of RNAI was amplified from the 
pUC19 plasmid using the primer pair Fwd T7-¢2.5 RNAI and Rev RNAI (Sup- 
plementary Table 1). RNA was transcribed in 200 pl scale in the presence of 40 mM 
Tris pH 8.1, 1 mM spermidine, 22 mM MgCl,, 0.01% Triton-X-100, 10 mM DTT, 
5% DMSO, 200 nM DNA template, 0.1 mg ml” ' T7 RNA polymerase (lab-prepared 
stock), 4mM of CTP, GTP and UTP and 2 mM of ATP. For transcription of mono- 
phosphorylated RNAI, the transcription reaction was doped with 200 [1Ci of radio- 
active AMP (**P a-AMP, 6,000 Cimmol”'; Hartmann Analytic). For preparation 
of triphosphate-RNAI, 200 Ci of radioactive ATP (?P y-ATP, 3,000 Cimmol~ 1, 
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Hartmann Analytic) and for NAD-RNAI 125 11Ci of radioactive NAD (adenylate-**P, 
800 Ci mmol}; Perkin Elmer) were used. For transcription of NADH-RNAI, 32p_ 
NAD (1.56 1M in 8 mM Tris buffer pH 9.2) was reduced to **P-NADH by non- 
enzymatic hydrogen transfer” in the presence of 50 mM NADH. The formation 
of 1,4-NADH was verified by a lactate dehydrogenase assay”. **P-NADH was 
purified by HPLC. A Phenomenex Kintex C18 column (2.6 um, 2.1 X 100 mm) 
was used and eluted at a flow rate of 1 ml min‘ with 0.1 M KH>PO,, 8 mM tetra- 
n-butylammonium hydrogensulphate, pH 6.0, with 0-15% (v/v) acetonitrile. 
NADH was desalted over a cooled Sep-Pak C18 Cartridge (10g, 55-105 um; 
Waters) and added to in vitro transcription reactions. Transcription reactions 
were incubated at 37 °C for 4h and purified on 10% denaturing polyacrylamide 
gels. Radioactively labelled RNA was eluted from gel slices overnight and isopro- 
panol precipitated. RNA concentrations were determined using the Qubit 2.0 Fluo- 
rometer (Life Technologies). 

RNase E, RppH and NudC cleavage assays. In vitro RNase E and NudC cleavage 
assays of in vitro transcribed monophosphorylated-, triphosphorylated-, NAD- or 
NADH-RNAI were performed in the presence of 25mM Tris pH 7.5, 50mM 
NaCl, 50mM KCl, 10mM MgCl, 1mM DTT, 5uM RNA and 1.34nM RNase 
E or 4.5nM NudC/NudC(E178Q). RppH reactions were carried out in presence 
of 1X NEBuffer 2 (New England Biolabs), 5 uM RNA, 0.5 U ult RppH (New 
England Biolabs). NAD-RNAI and NADH-RNAI cleavage reactions were always 
performed in the presence of 0.1 U pl’ FastAP thermosensitive alkaline phos- 
phatase (Thermo Scientific) in order to remove the **P-phosphate that became 
accessible only after cleavage of the NAD(H) pyrophosphate bond. 

Sequential decapping/digests of 5’-NAD-RNAI or 5’-triphosphate-RNA by NudC/ 
RNase E or RppH/RNase E were carried out as follows: 2.5 uM RNA (*?P «-CTP 
labelled triphosphorylated RNAI or **P-NAD-RNAI) was treated with NudC or 
RppH for 20 min at 37 °C, using the conditions described earlier. RNA was sepa- 
rated from enzymes by phenol/chloroform/ether extraction and isopropanol pre- 
cipitated. RNA pellets were dissolved in water and digested with RNase E using 
standard conditions described earlier. All time course reactions were performed at 
37°C and reactions were stopped by direct addition of one volume loading dye 
(10% TBE in formamide containing xylene cyanol and bromophenol blue). Sam- 
ples were loaded onto 18% denaturing polyacrylamide sequencing gels under stan- 
dard electrophoresis conditions (1X TBE buffer, run at 45 W for 3h). Gels were 
visualized using storage phosphor screens (GE Healthcare) and a Typhoon 9400 
imager (GE Healthcare) and quantified using ImageQuant software (Molecular 
Dynamics). Apparent rate constants k,,, were determined using the ‘linear quick 
fit’ tool in OriginPro 8 software (OriginLab). 

Quantification of NAD-RNA fractions. The NAD-RNA fraction was quantified 
in an adapted version of the PABLO assay*“°””. In the first step, total RNA was 
dephosphorylated using 0.2 Ug | total RNA FastAP thermosensitive alkaline 
phosphatase (Thermo Scientific). The reaction was performed on a 400 pl scale 
in the presence of 100 pg total RNA, 1X FastAP buffer (Thermo Scientific) for 
120 min at 37 °C. RNA was phenol/ether extracted and isopropanol precipitated. 
Twenty micrograms of the dephosphorylated RNA were incubated with 2 nmol 
NudC or NudC(E178Q) for 120 min at 37 °C to convert 5’-NAD-capped RNA to 
5'-monophosphate RNA (200 ul scale, using NEBuffer 3). Remaining RNA was 
isopropanol precipitated and redissolved in water. For ligation, 10 1g of the NudC- or 
NudC(E178Q)-treated total RNA were combined with 51M adaptor (Quant- 
Adaptor; Supplementary Table 1) and 5 1M of the respective splint (Splint-FL- 
RNAI, Splint-RNAI_; or Splint-6S; Supplementary Table 1) in a final volume of 
12 ul, heated to 90 °C for 30 s, and then chilled at room temperature for 5 min. The 
ligation reaction was incubated for 4h at 37 °C after addition of 50 mM Tris-HCl, 
pH7.5,0.5mM DTT, 50 uM ATP, 10 mM MgCl and 60 Weiss U T4 DNA ligase 
(Thermo Scientific). RNA was isopropanol precipitated, and analysed by north- 
ern blotting with target-specific probes for RNAI or 6S RNA. Ligation yields were 
calculated from the ratio of ligated RNA to ligated plus unligated RNA. Mean 
values and standard deviations were calculated from yields measured in three inde- 
pendent experiments. Ligation efficiencies were calculated for each individual sample 
by spiking 1 pmol in vitro transcribed NAD**P-RNAI, or NAD*P-6S RNA into 
the respective total RNA sample and treating the mixture with FastAP, NudC (or 
NudC(E178Q)) and T4 DNA ligase, as described earlier. The major processing 
product of RNAI, RNAI_s, which is generated by RNase E cleavage and thus bears 
a monophosphate, was not ligated in our NAD quantification assay, as expected. 
Therefore, to calculate its abundance, in order to estimate the percentage of full- 
length RNAI, we used a method very similar to the original PABLO assay***"', by 
performing ligation as described earlier, without prior dephosphorylation or NudC 
treatment. For calculation of the ligation efficiency of RNAI_;, °*P RNAI_; was 
spiked into total RNA and specifically ligated to the adaptor. For northern blot 
analysis, RNA was separated on 10% denaturing polyacrylamide gels (7 W, 65 min), 
transferred by electroblotting to Whatman Nytran SuPerCharge nylon blotting 
membrane (GE Healthcare) for 2.5 h at 250 mA and UV crosslinked. Membranes 


were prehybridized in 15 ml ULTRAhyb Ultrasensitive Hybridization Buffer (Life 
Technologies) for 2h at 48 °C. Target-specific RNA riboprobes (Supplementary 
Table 1) were prepared by in vitro transcription as described earlier, incorporating 
50 Ci of radioactive CTP CG ?P ¢-CTP, 3,000 Cimmol!; Hartmann Analytic) and 
50 Ci of radioactive ATP (??P &-ATP, 3,000 Cimmol '; Hartmann Analytic). 
The in vitro transcription product was DNase I (Thermo Scientific) digested and 
phenol/chloroform extracted. RNA was isopropanol precipitated and the pellet 
dissolved in 50 pl water. Ten microlitres of the riboprobe were added to the pre- 
hybridized membrane und incubated overnight at 48 °C. The blot was washed two 
times for 5 min with wash solution 1 (2X SSC, 0.1% SDS) and two times for 15 min 
with wash solution 2 (0.25 SSC, 0.1% SDS). Results were visualized and quan- 
tified as described earlier. For preparation of radioactive RNA size standards, RNA 
Century Marker Templates (Life Technologies) were in vitro transcribed using T7 
RNA polymerase in the presence of 40 |tCi *”P o-CTP. 

NudC processing of cellular RNAI in vitro. One-hundred micrograms total RNA 
was subjected to alkaline phosphatase FastAP to remove 5’-monophosphate- and 
5'-triphosphate termini (400 ll scale, 1X FastAP buffer, 0.2 U ug | total RNA FastAP 
thermosensitive alkaline phosphatase for 120 min at 37 °C). RNA was phenol/ether 
extracted and isopropanol precipitated. One-hundred micrograms dephosphory- 
lated RNA was incubated with 500 pmol NudC or NudC(E178Q). Aliquots were 
taken at different time intervals, reactions were stopped by flash freezing in liquid 
nitrogen and the RNA was isopropanol precipitated. NudC processing of RNAI 
and 6S RNA was analysed by ligation and northern blotting, as described earlier. 
RNA half-life determination. Measurements of RNA half-lives were performed 
in the following strains: wild type and AnudC for GcvB or wild type + RNAI 
and AnudC + RNAI for RNAI. Bacteria from overnight cultures were diluted to 
ODgo0nm = 0.1 and grown in LB medium at 37 °C. Half-lives of GcvB and RNAI at 
OD¢00nm = 2.0 were determined by treating cells with rifampicin (500 1g ml'). 
Samples were taken at time intervals before and after inhibition of the transcrip- 
tion and bacteria from wild type and AnudC strain were treated with RNAprotect 
Bacteria Reagent (Qiagen) to prevent RNA degradation. Then, RNA was isolated 
and analysed by northern blotting, as described earlier. 
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Extended Data Figure 1 | Development of the NAD capture protocol. 

a, General scheme: ADPRC-catalysed transglycosylation with different 
alcohols, followed by ‘click’ biotinylation with biotin-PEG3-azide. b, Isolated 
yields and HPLC retention times. c, HPLC chromatogram of the ADPRC- 
catalysed reaction of NAD with 4-pentyn-1-ol. d, High-resolution ESI mass 
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spectrum of the transglycosylation product. e, HPLC chromatogram of the 
copper-catalysed azide-alkyne cycloaddition (CuAAC) reaction mixture with 
the product from c. f, High-resolution ESI mass spectrum of the CuAAC 
reaction product. 
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Extended Data Figure 2 | NAD capture protocol applied to RNA and 
comparison of the enrichment in NGS data sets from three different NAD 
captureSeq experiments. a—c, Agarose gels of the PCR amplification products 
of cDNA obtained from the NAD capture protocol, applied to in vitro 
transcribed unmodified RNA and in vitro transcribed model NAD-RNA after 
5 (left), 10 (middle), and 15 (right) cycles (a); and total RNA from E. coli 
JM109 (b) or K-12 (c). The region (120-160 bp) around the band present in 
the JM109 sample (~150 bp, arrow) that occurred only in the fully treated 
sample but not in the controls was chosen for NGS libraries of minus ADPRC 
control Al and fully treated sample B1, in agreement with the expected 

size (<200 nucleotides) of NAD-modified RNA according to previous 
observations’. The intense band of ~150 bp size was later attributed to the 
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most abundant enriched RNA, RNAI, which was absent in the K-12 sample. 
For this reason, the K-12 sample did not contain such a prominent additional 
band. For comparability, the same region (120-160 bp) was excised for 
library preparation for K-12 control A2 and sample B2. To achieve a more 
comprehensive picture and study the influence of sample preparation on data, 
a third library was prepared from JM109 (control A3, sample B3) choosing a 
broader size range (50-300 bp). d-i, Abundance of RNAs in fully treated 
sample (B) versus minus ADPRC control (A) in data set 1 (JM109, 

120-160 bp; d, e), data set 2 (K-12, 120-160 bp; f, g), and data set 3 (JM109, 
50-300 bp; h, i) found by strand-specific NGS analysis (RPM, average of 
forward and reverse read counts; Supplementary Data 1-3). Triangles, 
enriched; circles, non-enriched. 
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Extended Data Figure 3 | NGS analysis of specific sequences found in 
control Al and sample B1 in forward and reverse reads. Analysis of sRNAs, 
sRNA-like 5’-terminal fragments of mRNAs, and non-enriched tRN ACE 
Normalized nucleotide counts derived from NGS reads (JM109, 120-160 bp) of 
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on the E. coli K-12 genome or the respective plasmid in which the sequences are 
situated. A black bar indicates the protein-coding sequence of mRNAs, while 
non-coding RNAs are indicated by a thin black line. Plus and minus signs 
designate the orientation of genes in the genome/plasmid. 
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Extended Data Figure 4 | NGS, real-time PCR, and northern blot analysis of | controls were used to assess primer dimerization. c, Northern blot analysis. 
RNAs enriched in NAD captureSeq. a, Enrichment of selected sequences The expected size range (~100 to 340 nucleotides for gatY, ~100 to 240 
calculated from NGS read numbers or real-time PCR data. Numbers in nucleotides for hdeD) is indicated and for hdeD the fragment of expected size 
parentheses show enrichment of genes below abundance threshold T in the is indicated by an arrow, d, Example of an enriched 5'-terminus, gatY. The 
respective data set. N.D., not determined. b, Real-time PCR analysis of selected _5’-untranslated region is indicated in grey, the coding sequence in black. Minus 
mRNAs for 5'-termini (regions found to be enriched) and regions in the middle __ sign indicates that the gene is encoded on the minus-strand of the genome, 
of the mRNA. Data show that the 5’-termini are highly enriched, while thus in reverse orientation. The position of the promoter (gatYp) is indicated by 
middle regions are not, or are considerably less enriched. All real-time PCR an arrow. Regions amplified in real-time PCR are indicated by opposing 
experiments were performed in duplicate for the same cDNA, and no-template —_ arrows. 
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Extended Data Figure 5 | Mass-spectrometric analysis of the ADPRC- chromatography fractions eluting at 2-2.5 min of untreated RNAI (d), RNAI 
catalysed transglycosylation reaction of NAD-RNAI from E. colitotal RNA _ treated with ADPRC and 4-pentyn-1-ol and then digested by nuclease P1 (e) 
and 4-pentyn-1-ol, and analysis of the 5'-modification status of GcvB. and 5S rRNA treated with ADPRC and 4-pentyn-1-ol and then digested by 
a, b, Disappearance of the ADP-ribose peak ([M-H] m/z = 540.05, elution nuclease P1 (f). g, HPLC-MS extracted ion chromatograms of the peak [M] ry 
at 5-6 min) (a), and appearance of a peak at m/z = 624.10 (elution time m/z = 664.116 of free NAD, and nuclease-P1-digested 5S rRNA and GcvB 
2.0-2.5 min) upon ADPRC reaction (b). Samples were digested with nuclease RNA. h, High-resolution mass spectra of the HPLC fractions eluting at 
P1 before mass spectrometry. c, Simulated signal pattern of the expected 2.2-2.4 min. 


reaction product (isotopic distribution). d-f, Observed mass spectrum in the 
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Extended Data Figure 6 | NudC processing, 5’-dependent sensitization to _is shown. c, d, Successive decapping and digestion of NAD-RNAI and 
RNase E processing by RppH or NudC, and investigation of RNA stability —_ triphosphate-RNAI. Time course of RppH/RNase E or NudC/RNase E 

in vivo depending on NudC expression. a, b, Decapping activity on in vitro decapping/digests of triphosphate-RNAI (c) or NAD-RNAI (d). 

transcribed NAD-RNAI of wild-type NudC enzyme and the E178Q point e-h, Determination of RNAI (mean of biological duplicate + s.d.) (e, f) and 
mutant. Gel-electrophoretic assay, monitoring the disappearance of a site- GcvB (g; h) cellular half-life depending on NudC expression by rifampicin stop 
specifically installed radiolabel (see Fig. 3c). Mean ofa technical duplicate + s.d. assay. Detection by northern blotting, tRNA* as loading control. 
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Strains for NAD captureSeq 
JM109 JM109 - pUC19 & F-plasmid 
K-12 DSM 498 - 2 
Strains to assess GcvB stability 
Wild type BW25113 - . 
AnudC JW5548-1 - - 
Strains to assess RNAI stability and/or quantify NAD-modification of RNAI 
Wild type + RNAI (1) - BW25113 pHSG 398-G_,-A, encoding RNAI 
AnudC + RNAI (2) : JW5548-1 pHSG 398-G_,-A, encoding RNAI 
AnudC + NudC + RNAI (3) = JW5548-1 pTACNudC, encoding wild-type NudC 
& pHSG 398-G_,-A, encoding RNAI 
AnudC + NudC(E178Q) + RNAI (4) = JW5548-1 pTACNudC(E178Q), encoding dysfunctional NudC 


& pHSG 398-G_,-A, encoding RNAI 


Strains for expression of wild-type and dysfunctional NudC 


BL21 + NudC - 
BL21 + NudC(E178Q) £ 


One Shot BL21 Star (DE3) 
One Shot BL21 Star (DE3) 


pET-28c-NudC, encoding wild-type NudC 
pET-28c-NudC(E178Q), encoding dysfunctional NudC 


Extended Data Figure 7 | Determination of the 5’-modification status of 
cellular RNAs by a modified PABLO assay*. a, Schematic overview of NAD 
quantification by dephosphorylation, NudC cleavage and splinted ligation. 
Total RNA is isolated from different E. coli strains: wild type + RNAI (1); 
AnudC + RNAI (2); AnudC + NudC + RNAI (3); or AnudC + NudC(E178Q) 
+ RNAI (4) (for strain details, see i). To remove all 5’-monophosphate and 5’- 
triphosphate termini, total RNA samples are subjected to alkaline phosphatase 
treatment. Then, to remove 5’-NAD and produce 5'-monophosphate termini, 
the dephosphorylated samples are treated with NudC (or NudC(E178Q) as 

a negative control). Finally, the resulting 5’-monophosphate RNA is ligated 
specifically to an adaptor by splinted ligation. Ligated fractions are determined 
by northern blotting. b, c, Validation of time-dependent in vitro NudC 
processing of cellular sRNAs (RNAI, 6S RNA) in total RNA from E. coli wild 
type + RNAI (1), using the assay described in a. Aliquots were taken after 
different times of NudC treatment and subjected to ligation as shown. tRNA°™ 
is blotted as loading control. d, e, Quantification of the efficiency of NudC 
treatment and splinted ligation (processing efficiency) for NAD-full-length 
RNAI (NAD-FL-RNAI) and NAD-6S RNA. To calculate the percentage of 
NAD modification (shown in Fig. 4), it is necessary to normalize to the 
cumulative processing efficiency of NudC treatment and splinted ligation of 
NAD-RNA, which is determined experimentally for each total RNA 


preparation. To this end, in vitro transcribed radiolabelled NAD**P-FL-RNAI 
(d) or NAD*”P-6S RNA (e) is spiked into total RNAs isolated from the four 
E. coli strains mentioned above. The mixture is then subjected to the treatment 
described in a. Processing efficiencies of radiolabelled RNAs are quantified 
ratiometrically after gel electrophoresis and electroblotting. f, g, Quantification 
of the percentage of RNAI_5. Cellular RNAI consists of a mixture of full-length 
and processed species, among them RNAI_; (ref. 42), which arises from 
RNase E cleavage, thus contains a 5’-monophosphate, and lacks only five 
5'-terminal nucleotides. Both species exhibit very similar electrophoretic 
mobility. Here, we were interested only in the percentage of NAD-modified 
FL-RNAI. Thus, we quantified the percentage of RNAI_; by PABLO analysis 
(splinted ligation of non-treated total RNA with RNAI_-;-specific adaptor). 
Carrying out the respective control (in vitro transcribed monophosphate™P- 
RNAL_;, spiked into total RNA samples) (f), we determined the percentage 
of ligation product, and normalized to ligation efficiencies to obtain the 
percentage of RNAI_; (g). With the help of the values determined in d-g, 
we then determined the percentage of FL-RNAI and the percentage of 
NAD-FL-RNAI (shown in Fig. 4a), using the formulas shown in h. Values 
shown in this figure represent replicate 1. (Expected) ligation products are 
marked with arrows (b-g). i, Strains used in this study. 
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Extended Data Table 1 | Listing of all enriched high-abundance sequences from three different NAD captureSeq experiments 


Data set 


RNA a ee Comments (e.g., function of SRNA or encoded protein) 
Enriched sRNAs (with real-time PCR validation) 
RNAI + 0 + Only in JM109; copy control RNA of pUC191745. (AS) 
CopA + 0 + Only in JM109; copy control RNA of F-plasmid'®“*, RNase III mediated regulation. (AS) 
GadY + + + Involved in acid stress response'®?', regulates stability and RNase III degradation’® of transcripts. (AS) 
GcvB + + + Involved in acid stress response””, regulates ABC transporters on translational level*® (AS) 
McaS + + + [Found as IS061 (non-strand-specific), due to unclear annotation .]Involved in biofilms. Regulates translation and binds Csra‘e*8, (AS) 
Chix + + + Involved in acid stress response”", regulates porin ChiP“°and DpiA/DpiB two-component system”. (AS) 


Enriched sRNAs (without real-time PCR validation) 


CsrC + (+4) + Binds to and regulates protein CsrA®’. 
DsrA + (+) + Acid stress response“, low-temperature expression of RpoS*°; regulates target stability & translation®®. (AS) 
GlmY (+) + + (Tke1); binds to Rapz°? protein, thereby regulates GlmZ, which regulates Gims*"*, 
MgrR + + + (STnc560); induced at low Mg”* concentration; involved in resistance against polymyxin Be. 
RNAIl + 0 + Pattern suggests RNAII could be enriched due to pairing with RNAI. Replication primer in RNAI-RNAII system of puc19%°. (AS) 
SibC - + + (QUAD) Repression of small toxic protein Ibsc®. (AS) 
SibD + + + (QUAD) Repression of small toxic protein IbsD®. (AS) 
SibE + + + (QUAD) Repression of small toxic protein IbsE®. (AS) 
SraC + (+) ag (RyeA); involved in acid stress response" ; RNase Ill induced cleavage of RyeB°". (AS) 
Enriched 5'-termini (with real-time PCR validation) 
hisL + + (+) His leader peptide; attenuates transcription of his GDCBHAFI operon (enzymes for histidine biosynthesis®*). 
rppH + (+) + RNA pyrophosphohydrolase; removes pyrophosphate, targets RNA for RNase E degradation®®> 
pgk + + + Mapped to epd 3'-terminus. Matches pgk transcript annotation (EcoCyc); phosphoglycerate kinase'®. 
gatY + + + Tagatose-1,6-bisphosphate aldolase; part of the galactitol catabolism pathway, 
hdeD + + + Acid resistance membrane protein'?>, 
ivL + + + llv leader peptide; attenuates transcription of iiVLGMEDA operon (enzymes for isoleucine and valine biosynthesis”). 
yfgG + + + Predicted protein. 
trmJ + + + tRNA Um32 methyltransferase/tRNA Cm32 methyltransferase®. 
uspE + + + Universal stress protein E. Implication in UV-stress, also in other stresses”, 
Enriched 5'-termini (without real-time PCR validation) 
acpP + + + Subunit of holo-acyl carrier protein®; important role in fatty acid biosynthesis; one of the most abundant proteins in E. coli”°. 
aspA - + + Aspartate ammonia lyase”"; catalyses the amination of fumarate to yield aspartate in pH-dependent manner”. 
rai a a Fs Stationary phase translation inhibitor ang ribosome stability factor; association with 705 ribosome in stationary phase”; stabilization 
of 70S ribosome’ ; inihibits translation initiation’” and elongation’ ” in response to stress. 
ssrS + + + Mapped to zapA 3'-end. Matches EcoCyc annotation of longer 6S transcript, processed by RNase E to mature 6S RNA”. 
bsmA # (+) * Involved in biofilm stress and motility. Mutants are less resistant to acid and peroxide stress’, 
cycA + (+) + Serine/alanine/glycine/cycloserine: H* symporter ’°*°. regulated by GcvB on translation level" . 
exuR af: (+) + DNA-binding transcription repressor“? of synthesis of exuR, galacturonate & glucuronate catabolism & transport genes. 
gIpA + (+) + Glycerol-3-phosphate dehydrogenase (anaerobic), large subunit; binds FAD™, 
glpD + (+) + Glycerol-3-phosphate dehydrogenase (aerobic); binds FAD®. 
mgsA (+) + + Methylglyoxal synthase”, 
nagC + + + Mapped to nagA 3'-end; NagC: DNA-binding dual transcriptional regulator of aminosugar biosynthesis®” °°. 
ihfA (+) + + Mapped to pheT 3'-end; IHF: integration host factor (a subunit); IHF important in gene expression®® and DNA bending??". 
poxA + (+) + Does not match annotated transcription start (5' of it); EF-P-lysine lysyltransferase”; aminoacylates EF-P at Lys34. 
racR + (+) + Rac prophage’’; predicted DNA-binding transcriptional regulator®® 
rlmE + + (4) 23S rRNA 2'-O-ribose U2552 methyltransferase™. 
thrs Bs (+) as Threonyl-tRNA synthetase”. 
traY Bs 0 : Only in JM109: protein encoded on F-plasmid; promotes nicking at oriT ©, 
ucpA + (+) + Predicted oxidoreductase; predicted to bind NAD”. 
yegE + (+) ag BluR DNA-binding transcriptional repressor; represses genes involved in biofilm formation & acid resistance”. 
hB + + + Conserved inner membrane protein®®. 
Enriched sequences with unclear annotation (without real-time PCR validation) 
aroC? - _ " Mapped to yfcB 3'-end (protein glutamine eee AroC: chorismate synthase; binds flavin'®'; no transcript 
cpdA? # (+) + Mapped to yqiB 3'-end (predicted dehydrogenase”); CpdA: cAMP phosphodiesterase no transcript annotated in EcoCyc. 
eno? + + + Mapped to pyrG 3'-end (CTP synthase'°’); does not match EcoCyc transcript annotation of eno. Enolase, part of degradosome™, 
glgC? + (+) + Mapped to g/gX 3'-end (debranching enzyme’); does not match transcript annotation of glgC. ADP-Glc PPase'. 
rpmJ? + + + Mapped to secY 3'-end (involved in protein secretion'®’); no transcript annotation for rpmJ. 50S ribosomal subunit protein L36'°. 
rseP? a (+) + Mapped to cdsA 3'-end (CDP-diglyceride synthetase’ Ply: no match: transcript annotation of rseP. signal peptide peptidase'""""?, 
sfsA? 4 a . Starts 3'- of sfsA 5'-end (predicted DNA-binding transcriptional regulator of maltose metabolism'''""*) 
does not match transcript annotation of dksA. RNA polymerase-binding transcription factor ~. 
thrB? : ee G2 en Mapped to thrA 3'-end (NAD(P)H-dependent kinase/dehydrogenase""®"”); no transcript annotation for thrB. Homoserine kinase’. 
ytcO? os + (+) Mapped to yfcO 3'-end (predicted protein, unknown function®®); could be 5'-end of yfcB (see aroC?). 
Examples of non-enriched genes (with real-time PCR validation) 
5S - - - 5S ribosomal RNA. 
tRNAS® 2 2 Serine tRNA. 
CsrB - - - Involved in acid stress response”’; binds to and regulates CsrA®". 
CyaR - - (-) Down-regulates e.g., outer membrane protein Ompx'? and NAD-synthetase Nade'*° by regulating translation. (AS) 
MicF - (-) (-) Down-regulates porin OmpF by inhibiting translation and destabilizing transcript in response to diverse stresses'*""'7 (AS) 
Examples of non-enriched genes (without real-time PCR) 
16S - - - 16S ribosomal RNA. 
23S - - - 23S ribosomal RNA. 
6S = = = Mature 6S; regulates transcription (stat. phase), repressing transcription from certain 070- and oS-dependent promoters’”123128_ 
tmRNA - - - Transfer-messenger RNA; helps releasing stalled ribosomes from mRNA without stop-codon or in lack of appropriate tRNA 126127, 
RyfD - - - Mapped to c/pB. Matches transcription start annotated on EcoCyc™*. An abundant non-coding sRNA of unknown function'®. 


Enriched sequences were found to be enriched in at least two out of three data sets. +, Sequence is enriched and more than the abundance threshold T; (+), sequence is enriched but less than T; —, sequence is not 
enriched and more than T; (—), sequence is less than T and not enriched; 0, sequence not found in data set (not expressed in strain); (AS), SRNA acts via antisense mechanism. Names in parentheses are alternative 
names, as found in the annotation used for GeneProf analysis. If sequences were mapped toa different gene than found or suspected to be the true gene (often the case for 5'-ends), the gene to which it was mapped 
is given in Comments. References 43-128 are cited in this table. 
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CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature14220 


Erratum: Antarctic glaciation 
caused ocean circulation changes at 
the Eocene- Oligocene transition 

A. Goldner, N. Herold & M. Huber 


Nature 511, 574-577 (2014); doi:10.1038/nature13597 


In this Letter, the Fig. 2a legend should state “Ocean surface temper- 
ature anomalies’ and the Fig. 2b legend should state “Zonally averaged 
ocean temperature anomaly’, rather than vice versa. This has been cor- 
rected in the online versions of the paper. 


378 | NATURE | VOL 519 | 19 MARCH 2015 
©2015 Macmillan Publishers Limited. All rights reserved 


CORRECTIONS & AMENDMENTS 


ERRATUM 
doi:10.1038/nature14271 


Erratum: Human intracellular 
ISG15 prevents interferon-a/B 
over-amplification and 
auto-inflammation 


Xianqin Zhang, Dusan Bogunovic, Beatrice Payelle-Brogard, 
Véronique Francois-Newton, Scott D. Speer, Chao Yuan, 
Stefano Volpi, Zhi Li, Ozden Sanal, Davood Mansouri, 

Ilhan Tezcan, Gillian I. Rice, Chunyuan Chen, Nahal Mansouri, 
Seyed Alireza Mahdaviani, Yuval Itan, Bertrand Boisson, 
Satoshi Okada, Lu Zeng, Xing Wang, Hui Jiang, Wenqiang Liu, 
Tiantian Han, Delin Liu, Tao Ma, Bo Wang, Mugen Liu, 
Jing-Yu Liu, Qing K. Wang, Dilek Yalnizoglu, 

Lilliana Radoshevich, Gilles Uzé, Philippe Gros, 

Flore Rozenberg, Shen- Ying Zhang, Emmanuelle Jouanguy, 
Jacinta Bustamante, Adolfo Garcia-Sastre, Laurent Abel, 
Pierre Lebon, Luigi D. Notarangelo, Yanick J. Crow, 

Stephanie Boisson-Dupuis, Jean-Laurent Casanova 

& Sandra Pellegrini 


Nature 517, 89-93 (2015); doi:10.1038/nature13801 


In this Letter, author J.-L.C. was inadvertently missing one of their 
affiliations; they should also have been associated with affiliation num- 
ber 2 (St. Giles Laboratory of Human Genetics of Infectious Diseases, 
Rockefeller Branch, The Rockefeller University, New York, New York 
10065, USA). This has been corrected in the online versions of the paper. 
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